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Preface 



The classical theory of Fourier series and integrals, as well as Laplace trans- 
forms, is of great importance for physical and technical applications, and 
its mathematical beauty makes it an interesting study for pure mathemati- 
cians as well. I have taught courses on these subjects for decades to civil 
engineering students, and also mathematics majors, and the present volume 
can be regarded as my collected experiences from this work. 

There is, of course, an unsurpassable book on Fourier analysis, the trea- 
tise by Katznelson from 1970. That book is, however, aimed at mathemat- 
ically very mature students and can hardly be used in engineering courses. 
On the other end of the scale, there are a number of more-or-less cookbook- 
styled books, where the emphasis is almost entirely on applications. I have 
felt the need for an alternative in between these extremes: a text for the 
ambitious and interested student, who on the other hand does not aspire to 
become an expert in the field. There do exist a few texts that fulfill these 
requirements (see the literature list at the end of the book), but they do 
not include all the topics I like to cover in my courses, such as Laplace 
transforms and the simplest facts about distributions. 

The reader is assumed to have studied real calculus and linear algebra 
and to be familiar with complex numbers and uniform convergence. On 
the other hand, we do not require the Lebesgue integral. Of course, this 
somewhat restricts the scope of some of the results proved in the text, but 
the reader who does master Lebesgue integrals can probably extrapolate 
the theorems. Our ambition has been to prove as much as possible within 
these restrictions. 
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Some knowledge of the simplest distributions, such as point masses and 
dipoles, is essential for applications. I have chosen to approach this mat- 
ter in two separate ways: first, in an intuitive way that may be sufficient 
for engineering students, in star-marked sections of Chapter 2 and sub- 
sequent chapters; secondly, in a more strict way, in Chapter 8, where at 
least the fundaments are given in a mathematically correct way. Only the 
one-dimensional case is treated. This is not intended to be more than the 
merest introduction, to whet the reader’s appetite. 

Acknowledgements. In my work I have, of course, been inspired by exist- 
ing literature. In particular, I want to mention a book by Arne Broman, 
Introduction to Partial Differential Equations... (Addison-Wesley, 1970), a 
compendium by Jan Petersson of the Chalmers Institute of Technology in 
Gothenburg, and also a compendium from the Royal Institute of Technol- 
ogy in Stockholm, by Jockum Aniansson, Michael Benedicks, and Karim 
Daho. I am grateful to my colleagues and friends in Uppsala. First of all 
Professor Yngve Domar, who has been my teacher and mentor, and who 
introduced me to the field. The book is dedicated to him. I am also partic- 
ularly indebted to Gunnar Berg, Christer O. Kiselman, Anders Kallstrom, 
Lars-Ake Lindahl, and Lennart Sailing. Bengt Carlsson has helped with 
ideas for the applications to control theory. The problems have been worked 
and re-worked by Jonas Bjermo and Daniel Domert. If any incorrect an- 
swers still remain, the blame is mine. 

Finally, special thanks go to three former students at Uppsala University, 
Mikael Nilsson, Matthias Palmer, and Magnus Sandberg. They used an 
early version of the text and presented me with very constructive criticism. 
This actually prompted me to pursue my work on the text, and to translate 
it into English. 



Uppsala, Sweden 
January 2003 



Anders Vretblad 
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1 

Introduction 



1.1 The classical partial differential equations 



In this introductory chapter, we give a brief survey of three main types of 
partial differential equations that occur in classical physics. We begin by 
establishing some convenient notation. 

Let be a domain (an open and connected set) in three-dimensional 
space R 3 , and let T be an open interval on the time axis. By (7 fe (fi), resp. 
C k (Q x T), we mean the set of all real- valued functions u(x,y,z), resp. 
u(x,y,z,t), with all their partial derivatives of order up to and including 
k defined and continuous in the respective regions. It is often practical to 
collect the three spatial coordinates (x, y, z) in a vector x and describe the 
functions as u(x), resp. u(x, t). By A we mean the Laplace operator 



A = V 2 := 



d 2 d 2 d 2 

dx 2 + dy 2 dz 2 ’ 



Partial derivatives will mostly be indicated by subscripts, e.g., 

_ du _ d 2 u 

" at ’ Uyx = d ^ ‘ 



The first equation to be considered is called the heat equation or the 
diffusion equation: 



1 du 
~*&t' 



(x, t) G x T. 




2 



1. Introduction 



As the name indicates, this equation describes conduction of heat in a 
homogeneous medium. The temperature at the point x at time t is given 
by -u(x, £), and a is a constant that depends on the conducting properties 
of the medium. The equation can also be used to describe various processes 
of diffusion, e.g., the diffusion of a dissolved substance in the solvent liquid, 
neutrons in a nuclear reactor, BROWNian motion, etc. 

The equation represents a category of second-order partial differential 
equations that is traditionally categorized as parabolic. Characteristically, 
these equations describe non-reversible processes, and their solutions are 
highly regular functions (of class C°°). 

In this book, we shall solve some special problems for the heat equa- 
tion. We shall be dealing with situations where the spatial variable can be 
regarded as one-dimensional: heat conduction in a homogeneous rod, com- 
pletely isolated from the exterior (except possibly at the ends of the rod). 
In this case, the equation reduces to 

1 

^XX — 9 'U't • 

a 1 

The wave equation has the form 

1 d^u 

Au= ^dt a’ MeftxT. 

where c is a constant. This equation describes vibrations in a homogeneous 
medium. The value u(x,t) is interpreted as the deviation at time t from 
the position at rest of the point with rest position given by x. 

The equation is a case of hyperbolic equations. Equations of this category 
typically describe reversible processes (the past can be deduced from the 
present and future by “reversion of time”). Sometimes it is even suitable 
to allow solutions for which the partial derivatives involved in the equation 
do not exist in the usual sense. (Think of shock waves such as the sonic 
bangs that occur when an aeroplane goes supersonic.) We shall be studying 
the one- dimensional wave equation later on in the book. This case can, for 
instance, describe the motion of a vibrating string. 

Finally we consider an equation that does not involve time. It is called 
the Laplace equation and it looks simply like this: 



A u = 0. 



It occurs in a number of physical situations: as a special case of the heat 
equation, when one considers a stationary situation, a steady state , that 
does not depend on time (so that Ut = 0); as an equation satisfied by the 
potential of a conservative force; and as an object of considerable purely 
mathematical interest. Together with the closely related POISSON equa- 
tion, Au(x) = F(x), where F is a known function, it is typical of equations 
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classified as elliptic. The solutions of the Laplace equation are very regular 
functions: not only do they have derivatives of all orders, there are even cer- 
tain possibilities to reconstruct the whole function from its local behaviour 
near a single point. (If the reader is familiar with analytic functions, this 
should come as no news in the two-dimensional case: then the solutions 
are harmonic functions that can be interpreted (locally) as real parts of 
analytic functions.) 

The names elliptic , parabolic, and hyperbolic are due to superficial sim- 
ilarities in the appearance of the differential equations and the equations 
of conics in the plane. The precise definitions of the different types are as 
follows: The unknown function is u = u(x) = u(x\, X 2 , . . . , x m ). The equa- 
tions considered are linear; i.e., they can be written as a sum of terms equal 
to a known function (which can be identically zero), where each term in 
the sum consists of a coefficient (constant or variable) times some deriva- 
tive of u , or u itself. The derivatives are of degree at most 2. By changing 
variables (possibly locally around each point in the domain), one can then 
write the equation so that no mixed derivatives occur (this is analogous to 
the diagonalization of quadratic forms). It then reduces to the form 

a\Un + a 2 U 22 H b Orm^mm + {terms containing Uj and u} = /(x), 

where Uj = du/dxj etc. If all the aj have the same sign, the equation is 
elliptic; if at least one of them is zero, the equation is parabolic; and if 
there exist a/s of opposite signs, it is hyperbolic. 

An equation can belong to different categories in different parts of the 
domain, as, for example, the Tricomi equation u xx + xu yy = 0 (where 
u = u{x,y)), which is elliptic in the right-hand half-plane and hyperbolic 
in the left-hand half-plane. Another example occurs in the study of the 
so-called velocity potential u(x , y) for planar laminary fluid flow. Consider, 
for instance, an aeroplane wing in a streaming medium. In the case of ideal 
flow one has A u = 0. Otherwise, when there is friction (air resistance), the 
equation looks something like (1— M 2 )u xx +u yy = 0, with M = v/vq, where 
v is the speed of the flowing medium and Vo is the velocity of sound in the 
medium. This equation is elliptic, with nice solutions, as long as v < v$, 
while it is hyperbolic if v > Vo and then has solutions that represent shock 
waves (sonic bangs). Something quite complicated happens when the speed 
of sound is surpassed. 



1.2 Well-posed problems 

A problem for a differential equation consists of the equation together with 
some further conditions such as initial or boundary conditions of some form. 
In order that a problem be “nice” to handle it is often desirable that it have 
certain properties: 
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1. There exists a solution to the problem. 

2. There exists only one solution (i.e., the solution is uniquely deter- 
mined) . 

3. The solution is stable , i.e., small changes in the given data give rise 
to small changes in the appearance of the solution. 

A problem having these properties (the third condition must be made 
precise in some way or other) is traditionally said to be well posed. It is, 
however, far from true that all physically relevant problems are well posed. 
The third condition, in particular, has caught the attention of mathemati- 
cians in recent years, since it has become apparent that it is often very 
hard to satisfy it. The study of these matters is part of what is popularly 
labeled chaos research. 

To satisfy the reader’s curiosity, we shall give some examples to illuminate 
the concept of well-posedness. 

Example 1.1. It can be shown that for suitably chosen functions / € C°°, 
the equation u x + u y + (x + 2 iy)u t = / has no solution u = u(x , ?/, t) at 
all (in the class of y complex- valued functions) (Hans Lewy, 1957). Thus, in 
this case, condition 1 fails. □ 

Example 1.2. A natural problem for the heat equation (in one spatial 
dimension) is this one: 

u xx (x, t) = ut(x, t), x > 0, t > 0; u(x, 0) = 0, x > 0; t/(0, t) = 0, t > 0. 

This is a mathematical model for the temperature in a semi-infinite rod, 
represented by the positive x-axis, in the situation when at time 0 the rod 
is at temperature 0, and the end point x = 0 is kept at temperature 0 the 
whole time t > 0. The obvious and intuitive solution is, of course, that the 
rod will remain at temperature 0, i.e., u(x,t) = 0 for all x > 0, t > 0. But 
the mathematical problem has additional solutions: let 

u(x,t) = ^2 e ~ X 5 x > 0, t > 0. 

It is a simple exercise in partial differentiation to show that this function 
satisfies the heat equation; it is obvious that u(0,t) = 0, and it is an 
easy exercise in limits to check that lim u(x,t) = 0. The function must be 

considered a solution of the problem, as the formulation stands. Thus, the 
problem fails to have property 2. 

The disturbing solution has a rather peculiar feature: it could be said to 
represent a certain (finite) amount of heat, locat ed at the end point of the 
rod at time 0. The value of u(y/2t,t) is y/(2/e)/t, which tends to -foo as 
t\0. One way of excluding it as a solution is adding some condition to 
the formulation of the problem; as an example it is actually sufficient to 
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demand that a solution must be bounded. (We do not prove here that this 
does solve the dilemma.) □ 

Example 1.3. A simple example of instability is exhibited by an ordinary 
differential equation such as y"(t) + y(t) = f(t) with initial conditions 
y( 0) = 1 , y f {0) = 0 . If, for example, we take f(t) = 1 , the solution is y(t) — 
1 . If we introduce a small perturbation in the right-hand member by taking 
f(t) = 1 + e cos £, where e ^ 0 , the solution is given by y(t) = 1 + \ et sin t. 
As time goes by, this expression will oscillate with increasing amplitude 
and “explode” . The phenomenon is called resonance. □ 



1.3 The one-dimensional wave equation 



We shall attempt to find all solutions of class C 2 of the one-dimensional 
wave equation 

C Uxx — 'U'tt’ 

Initially, we consider solutions defined in the open half-plane t > 0. 
Introduce new coordinates (£, 77 ), defined by 

£ = x — ct, 77 = x -f ct. 



It is an easy exercise in applying the chain rule to show that 



_ d 2 u _ d 2 u d 2 u d 2 u 
Uxx dx 2 <9£ 2 + <9£ ch? + dr} 2 

d 2 u 2 / 9 2 u d 2 u d 2 u\ 

Utt dt 2 ° \ 9£ 2 9£ dr} dr} 2 ) * 

Inserting these expressions in the equation and simplifying we obtain 



c 2 • 4 



d 2 u 
<9£ dr} 



= 0 



d_(du 
9£ \dr} 



= 0 . 



Now we can integrate step by step. First we see that du/dr} must be a 
function of only 77 , say, du/ dr} = h(r}). If xjj is an antiderivative of ft, another 
integration yields u = <p(£) + ^( 77 ), where ip is a new arbitrary function. 
Returning to the original variables (#,£), we have found that 



u(x, t) = (p(x — ct) + i/j(x + ct). 



(1.1) 



In this expression, </? and ^ are more-or-less arbitrary functions of one 
variable. If the solution u really is supposed to be of class C 2 , we must 
demand that ip and -0 have continuous second derivatives. 

It is illuminating to take a closer look at the significance of the two terms 
in the solution. First, assume that 0(s) = 0 for all 8 , so that u(x,t) = 
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FIGURE 1.2. 



(p(x — ct). For t = 0, the graph of the function x u(x, 0) looks just like 
the graph of (p itself. At a later moment, the graph of x u(x,t ) will 
have the same shape as that of </?, but it is pushed ct units of length to the 
right. Thus, the term p(x — ct) represents a wave moving to the right along 
the x-axis with constant speed equal to c. See Figure 1.1! In an analogous 
manner, the term xp(x + ct) describes a wave moving to the left with the 
same speed. The general solution of the one-dimensional wave equation 
thus consists of a superposition of two waves, moving along the x-axis in 
opposite directions. 

The lines x±ct = constant, passing through the half-plane t > 0, consti- 
tute a net of level curves for the two terms in the solution. These lines are 
called the characteristic curves or simply characteristics of the equation. 
If, instead of the half-plane, we study solutions in some other region Z), the 
derivation of the general solution works in the same way as above, as long 
as the characteristics run unbroken through D. In a region such as that 
shown in Figure 1.2, the function ip need not take on the same value on the 
two indicated sections that do lie on the same line but are not connected 
inside D. In such a case, the general solution must be described in a more 
complicated way. But if the region is convex , the formula (1.1) gives the 
general solution. 
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Remark. In a way, the general behavior of the solution is similar also in higher 
spatial dimensions. For example, the two-dimensional wave equation 



dx 2 ^ dy 2 



u 

c 2 dt 2 



has solutions that represent wave-shapes passing the plane in all directions, and 
the general solution can be seen as a sort of superposition of such solutions. But 
here the directions are infinite in number, and there are both planar and circular 
wave- fronts to consider. The superposition cannot be realized as a sum — one 
has to use integrals. It is, however, usually of little interest to exhibit the general 
solution of the equation. It is much more valuable to be able to pick out some 
particular solution that is of importance for a concrete situation. □ 

Let us now solve a natural initial value problem for the wave equation 
in one spatial dimension. Let f(x ) and g(x) be given functions on R. We 
want to find all functions u(x,t) that satisfy 



(p) 



c 2 u xx = u t t , —oo < x < oo, t > 0; 

u(x,0) = f(x), ut(x,0) = g(x), — oo < x < oo. 



(The initial conditions assert that we know the shape of the solution at 
t = 0, and also its rate of change at the same time.) By our previous 
calculations, we know that the solution must have the form (1.1), and so 
our task is to determine the functions (p and ^ so that 

f(x) = u(x, 0) = <p(x)+^(x), g(x) = u t (x, 0) = -c(p , (x)+crp , (x). (1.2) 

An antiderivative of g is given by G(x) = g(y) dy , and the second formula 
can then be integrated to 



-p(x) + ^(x) = - G(x) + K , 
c 

where K is the integration constant. Combining this with the first formula 
of (1.2), we can solve for p and 



¥>(*) = \ (/(*) - \ G{x) - k ) , 

Substitution now gives 

u(x, t) = p(x — ct ) + ^(x + ct ) 



^(*) = \ (/(*) + \ G{x) + tf) . 



= g 1 /(* - ct ) 



■ G{x — ct) — K + f(x + ct) + - G(x + ct) + K 



f(x — ct) + f(x + ct) G(x + ct) — G(x — ct) 

r\ 



2 c 



/( X -ct) + f(x + ct) 



x+ct 



+ 



h / 9{y)dy • 



x—ct 



(1.3) 
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The final result is called d’Alembert’s formula. It is something as rare 
as an explicit (and unique) solution of a problem for a partial differential 
equation. 

Remark. If we want to compute the value of the solution u(x , t) at a particular 
point (xo,to), d’Alembert’s formula tells us that it is sufficient to know the initial 
values on the interval [xq — cto , xo + cto]: this is again a manifestation of the fact 
that the “waves” propagate with speed c. Conversely, the initial values taken on 
[xo — cto, xo -fcto] are sufficient to determine the solution in the isosceles triangle 
with base equal to this interval and having its other sides along characteristics. 
See Figure 1.3. □ 

In a similar way one can solve suitably formulated problems in other 
regions. We give an example for a semi-infinite spatial interval. 

Example 1.4. Find all solutions u(x, t ) of u xx = u u for x > 0, t > 0, that 
satisfy u(x, 0) = 2x and u t (x, 0) = 1 for x > 0 and, in addition, u(0, t) = 2 t 
for t > 0. 

Solution. Since the first quadrant of the xt - plane is convex, all solutions of 
the equation must have the appearance 

u(x, t ) = (p(x — t) + ip(x + t), x > 0, t > 0. 

Our task is to determine what the functions ip and ^ look like. We need 
information about ^(s) when s is a positive number, and we must find out 
what <p(s) is for all real s. 

If t = 0 we get 2x = u(x , 0) = <p(x) + ^(x) and 1 = ut(x, 0) = —(f'{x) + 
^'(x); and for x = 0 we must have 2 1 = <p(—t) -\-^(t). To liberate ourselves 
from the magic of letters, we neutralize the name of the variable and call 
it s. The three conditions then look like this, collected together: 

25= (f{s) +^{s) 

< 1 = — <p f (s) + 'ip'(s) s > 0. 

^2 s= (p(-s ) +i/>(s) 
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The second condition can be integrated to — <p(s) + 'i)j(s) = s + C, and 
combining this with the first condition we get 

<p(s) = \ s - \ C, %/)( s ) = \s+\C for 8 > 0. 

The third condition then yields ^p(-s) = 2 s — 'ifj(s) = | s — | C, s > 0, 
where we switch the sign of s to get 

ip(s) = — | s - \ C for 5 < 0. 

Now we put the solution together: 



u(x, t) = ip(x — t) + rf(x + t) = 



\{x — t) + |(x + 1) = 2x + 1, x > t > 0, 
— x) + |(x + 1) = x + 2t, 0 < x < t. 



Evidently, there is just one solution of the given problem. 

A closer look shows that this function is continuous along the line x = £, 
but it is in fact not differentiable there. It represents an “angular” wave. 
It seems a trifle fastidious to reject it as a solution of the wave equation, 
just because it is not of class C 2 . One way to solve this conflict is furnished 
by the theory of distributions , which generalizes the notion of functions in 
such a way that even “angular” functions are assigned a sort of derivative. 

□ 



Exercise 

2 | 

1.1 Find the solution of the problem (P), when f(x) = e~ x , g(x) = — - — - • 

1.4 Fourier’s method 

We shall give a sketch of an idea that was tried by Jean-Baptiste Joseph 
Fourier in his famous treatise of 1822, Theorie analytique de la chaleur. 
It constitutes an attempt at solving a problem for the one-dimensional 
heat equation. If the physical units for heat conductivity, etc., are suitably 
chosen, this equation can be written as 

^ XX = , 

where u = u(x, t) is the temperature at the point x on a thin rod at time 
t. We assume the rod to be isolated from its surroundings, so that no 
exchange of heat takes place, except possibly at the ends of the rod. Let 
us now assume the length of the rod to be 7r, so that it can be identified 
with the interval [0, n] of the x-axis. In the situation considered by Fourier, 
both ends of the rod are kept at temperature 0 from the moment when 
t = 0, and the temperature of the rod at the initial moment is assumed to 
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be equal to a known function f{x). It is then physically reasonable that we 
should be able to find the temperature u(x , t) at any point x and at any 
time t > 0. The problem can be summarized thus: 

r (E) u xx = u t , 0 < x < it, t > 0; 

< (B) -u(0, t) = u(n, t) = 0, t > 0; (1.4) 

(I) u(x, 0) = f(x), 0 < X < 7T. 

The letters on the left stand for equation , boundary conditions, and ini- 

tial condition , respectively. The conditions (E) and (B) share a specific 
property: if they are satisfied by two functions u and v, then all linear 
combinations au + /3v of them also satisfy the same conditions. This prop- 
erty is traditionally expressed by saying that the conditions (E) and (B) 
are homogeneous. Fourier’s idea was to try to find solutions to the partial 
problem consisting of just these conditions, disregarding (I) for a while. 

It is evident that the function u(x,t) = 0 for all (x,t) is a solution of 
the homogeneous conditions. It is regarded as a trivial and uninteresting 
solution. Let us instead look for solutions that are not identically zero. 
Fourier chose, possibly for no other reason than the fact that it turned out 
to be fruitful, to look for solutions having the particular form u{x,t) = 
X(x) T(t ), where the functions X(x) and T(t) depend each on just one of 
the variables. 

Substituting this expression for u into the equation (E), we get 
X"(x) T{t) = X(x) T\t ), 0 < x < 7T, t> 0. 



If we divide this by the product X{x)T{t) (consciously ignoring the risk 
that the denominator might be zero somewhere), we get 



X"(x) _ T'(t) 
X(x) ~ T(t) ’ 



0 < X < 7T, t > 0. 



(1.5) 



This equality has a peculiar property. If we change the value of the variable 
t , this does not affect the left-hand member, which implies that the right- 
hand member must also be unchanged. But this member is a function of 
only £; it must then be constant. Similarly, if x is changed, this does not 
affect the right-hand member and thus not the left-hand member, either. 
Indeed, we get that both sides of the equality are constant for all the values 
of x and t that are being considered. This constant value we denote (by 
tradition) by —A. This means that we can split the formula (1.5) into two 
formulae, each being an ordinary differential equation: 



X"(x) + XX (x) = 0, 0 < x < tt; T'(t) + A T(t) = 0, t > 0. 



One usually says that one has separated the variables , and the whole method 
is also called the method of separation of variables. 
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We shall also include the boundary condition (B). Inserting the expres- 
sion u(x , t) = X(x) T(t), we get 

X(0) T(t) = X(tt) T(t) = 0, t > 0. 

Now if, for example, X(0) 7^ 0, this would force us to have T(t) = 0 for 
t > 0, which would give us the trivial solution u(x,t) = 0. If we want to 
find interesting solutions we must thus demand that X(0) = 0; for the same 
reason we must have X(i r) = 0. This gives rise to the following boundary 
value problem for X: 

X"(x) + XX (x) = 0, 0 < x < tt; X(0) = X(i r) = 0. (1.6) 

In order to find nontrivial solutions of this, we consider the different possible 
cases, depending on the value of A. 

A < 0: Then we can write A = — a 2 , where we can just as well assume 
that a > 0. The general solution of the differential equation is then X(x) = 
Ae ax + Be~ ax . The boundary conditions become 

f 0 = X(0) = A + B, 

\ 0 = X(i r) = Ae an + Be~ a7r . 

This can be seen as a homogeneous linear system of equations with A and 
B as unknowns and determinant e~ a7T — e air = — 2sinha7r / 0. It has thus 
a unique solution A = B = 0, but this leads to an uninteresting function 
X. 

A = 0: In this case the differential equation reduces to X n (x) = 0 with 
solutions X{x) = Ax + B, and the boundary conditions imply, as in the 
previous case, that A = B = 0, and we find no interesting solution. 

A > 0: Now let A = a; 2 , where we can assume that 00 > 0. The general 
solution is given by X(x) = A cos ujx + Bsinc«;x. The first boundary con- 
dition gives 0 = X(0) = A, which leaves us with X(x) = B sin ujx. The 
second boundary condition then gives 

0 = X(i r) = Bsinu;7r. (1.7) 

If here B = 0, we are yet again left with an uninteresting solution. But, 
happily, (1.7) can hold without B having to be zero. Instead, we can arrange 
it so that uj is chosen such that sino;7r = 0, and this happens precisely if u 
is an integer. Since we assumed that uj > 0 this means that u is one of the 
numbers 1,2,3, — 

Thus we have found that the problem (1.6) has a nontrivial solution 
exactly if A has the form A = n 2 , where n is a positive integer, and then 
the solution is of the form X(x) = X n (x) = B n sinnx, where B n is a 
constant. 

For these values of A, let us also solve the problem T'(t) + A T(t) = 0 or 
T'(t) = -n 2 T(t), which has the general solution T(t) = T n (t ) = C n e~ n2t . 
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If we let B n C n = b n , we have thus arrived at the following result: The 
homogeneous problem (E)+(B) has the solutions 

u(x,t) = u n (x,t) = b n e~ n 1 sin rax, n = 1,2,3, 

Because of the homogeneity, all sums of such expressions are also solutions 
of the same problem. Thus, the homogeneous sub-problem of the original 
problem (1.4) certainly has the solutions 

N 

u(x,t) = ^6 n e -n * sin nx, (1.8) 

71 = 1 

where N is any positive integer and the b n are arbitrary real numbers. The 
great question now is the following: among all these functions, can we find 
one that satisfies the non-homogeneous condition (I): ra(x,0) = f(x) = a 
known function? 

Substitution in (1:8) gives the relation 

N 

f(x) = u(x, 0) = E b n sinnx , 0 < x < n. (1.9) 

71 = 1 

If the function / happens to be a linear combination of sine functions of 
this kind, we can consider the problem as solved. Otherwise, it is rather 
natural to pose a couple of questions: 

1. Can we permit the sum in (1.8) to consist of an infinity of terms? 

2. Is it possible to approximate a (more or less) arbitrary function / 
using sums like the one in (1.9)? 

The first of these questions can be given a partial answer using the theory 
of uniform convergence. The second question will be answered (in a rather 
positive way) later on in this book. We shall return to our heat conduction 
problem in Chapter 6. 

Exercise 

1.2 Find a solution of the problem treated in the text if the initial condition 
(I) is u(x , 0) = sin 2x + 2 sin 5x. 



Historical notes 

The partial differential equations mentioned in this section evolved during the 
eighteenth century for the description of various physical phenomena. The La- 
place operator occurs, as its name indicates, in the works of Pierre Simon de 
Laplace, French astronomer and mathematician (1749-1827). In the theory of 




Historical notes 13 



analytic functions, however, it had surely been known to Euler before it was 
giveh its name. 

The wave equation was established in the middle of the eighteenth century 
and studied by several famous matheiriaticans, such as J. L. R. d’Alembert 
(1717-83), Leonhard Euler (1707-83) and Daniel Bernoulli (1700-82). 

The heat equation came into focus at the beginning of the following century. 
The most important name in its early history is Joseph Fourier (1768-1830). 
Much of the contents of this book has its origins in the treatise Theorie analytique 
de la chaleur. We shall return to Fourier in the historical notes to Chapter 4. 




2 

Preparations 



2.1 Complex exponentials 

Complex numbers are assumed to be familiar to the reader. The set of all 
complex numbers will be denoted by C. The reader has probably come 
across complex exponentials at some occasion previously, but, to be on the 
safe side, we include a short introduction to this subject here. 

It was discovered by Euler during the eighteenth century that a close 
connection exists between the exponential function e z and the trigonomet- 
ric functions cos and sin. One way of seeing this is by considering the 
Maclaurin expansions of these functions. The exponential function can be 
described by 



z 2 z 3 z A 

+ Z+ 2f + 3!‘ + ¥ + 



■E 

n=0 



n\ 



where the series is nicely convergent for all real values of z. Euler had the 
idea of letting z be a complex number in this formula. In particular, if z is 
purely imaginary, z = iy with real y, the series can be rewritten as 



e <, =1+i9 + M + M + M + . 



2 ! 



3! 



4! 



« 2 y 3 ?/ 4 y 5 
1 + i ^2f- i ¥ + 4[ + i ll-- 



1 



+ yl _ yl . 

2! 4! 6! 



y 3 y 5 y 7 

+ i ^-V + *-7i + 




16 2. Preparations 



In the brackets we recognize the well-known expansions of cos and sin: 



, y 2 ,y 4 y 6 

co S9 =l-- + i[ -- + . 

y 3 ,y 5 r / , 
Sm!, = ’'- 3 ! + 5 ! “ 7 ! + 



Accordingly, we define 

e iy = cosy + isiny. (2.1) 

This is one of the so-called Eulerian formulae. The somewhat adventurous 
motivation through our manipulation of a series can be completely justified, 
which is best done in the context of complex analysis. For this book we shall 
be satisfied that the formula is true and can be used. 

What is more, one can define exponentials with general complex argu- 
ments: 



e x+w _ e x e %y _ e x^ C0S y _j_ isiny) if x and y are real. 



The function thus obtained obeys most of the well-known rules for the real 
exponential function. Notably, we have these rules: 



= e* + ™, 



1 

— — e 
e z 



= e 



It is also true that e z ^ 0 for all z, but it need no longer be true that 
e z >0. 



Example 2.1. e l7V = cos7r + zsin7r = — 1 + i • 0 = —1. Also, e m7r = (— l) n 
if n is an integer (positive, negative, or zero). Furthermore, e Z7r//2 = i is 
not even real. Indeed, the range of the function e z for z G C contains all 
complex numbers except 0. □ 



Example 2.2. The mod ulus of a complex number z = x + iy is defined 
as |^| = = a/# 2 + y 2 - As a consequence, 

\e z \ = \e x + iy \ = \e x • e iy \ = e x \ cosy + isiny| = e x \J cos 2 y + sin 2 y = e x . 

In particular, if z = iy is a purely imaginary number, then \e z \ = \e iy \ = 1. 

□ 



Example 2.3. Let us start from the formula e lx e iy = e^ x+y ^ and rewrite 
both sides of this, using (2.1). On the one hand we have 

e lx e iy = (cos x + i sin x) (cos y + i sin y) 

= cos x cos y — sin x sin y + i( cos x sin y + sin x cos y), 

and on the other hand, 

e i(x+y) _ cos ^ _(_ i s in(x + y). 
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If we identify the real and imaginary parts of the trigonometric expressions, 
we see that 

cos(x + y) = cos x cos y — sin x sin y , sin(x + y) = cos x sin y + sin x cos y. 

Thus the addition theorems for cos and sin are contained in a well-known 
exponential law! □ 

By changing the sign of y in (2.1) and then manipulating the formulae 
obtained, we find the following set of equations: 



e iy = cos y + % sin y 



cos y = 



e y = cos y — ism y 



oW _ 



sin y — 



These are the “complete” set of Euler’s formulae. They show how one can 
pass back and forth between trigonometric expressions and exponentials. 

Particularly in Chapters 4 and 7, but also in other chapters, we shall 
use the exponential expressions quite a lot. For this reason, the reader 
should become adept at using them by doing the exercises at the end of 
this section. If these things are quite new, the reader is also advised to find 
more exercises in textbooks where complex numbers are treated. 

Exercises 

2.1 Compute the numbers e i7r/2 , e~ i7r/4 , e 57ri/6 , e ln2 " i7r/ 6 . 

2.2 Prove that the function f(z ) = e z has period 2ni, i.e., that f(z+ 2m) = f(z) 
for all z. 

2.3 Find a formula for cos 3 1, expressed in cost, by manipulating the identity 

e 3it = 

2.4 Prove the formula sin 3 1 = | sint — \ sin3t. 

2.5 Show that if \e z \ = 1, then z is purely imaginary. 

2.6 Prove the de Moivre formula: 

(cos t -f i sin t) n = cos nt + i sin nt, n integer. 



2.2 Complex- valued functions of a real variable 

In order to perform calculus on complex- valued functions, we should define 
limits of such objects. As long as the domain of definition lies on the real 
axis, this is quite simple and straightforward. One can use similar formu- 
lations as in the all-real case, but now modulus signs stand for moduli of 
complex numbers. For example: if we state that 

lim f(t) = A, 

£—>•00 
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then we are asserting the following: for every positive number e, there exists 
a number R such that as soon as t > R we are assured that \f(t) — A\ < e. 
If we split f(t ) into real and imaginary parts, 

f{t) = u(t) + iv(t ), u(t) and v(t) real, 

the following inequalities hold: 

K*)| < 1/(01. K*)l < 1/(01; 1/(01 < l«(*)l + !«(*)!• ( 2 - 2 ) 



This should make it rather clear that convergence in a complex-valued 
setting is equivalent to the simultaneous convergence of real and imaginary 
parts. Indeed, if the latter are both small, then the complex expression 
is small; and if the complex expression is small, then both its real and 
imaginary parts must be small. In practice this means that passing to 
a limit can be done in the real and imaginary parts, which reduces the 
complex-valued situation to the real-valued case. 

Thus, if we want to define the derivative of a complex- valued function 
f(t) = u(t) + iv(t), we can go about it in two ways. Either we define 



f'(t) = lim. 
h — >-0 



f(t + h)~ f(t) 
h 



which stands for an e-8 notion involving complex numbers, or we can just 
say that 

/'(£) = u'(t) + iv f (t). (2.3) 

These definitions are indeed equivalent. The derivative of a complex- valued 
function of a real variable t exists if and only if the real and imaginary parts 
of / both have derivatives, and in this case we also have the formula (2.3). 
The following example shows the most frequent case of this, at least in this 
book. 

Example 2.4. If f(t) = e ct with a complex coefficient c = a + z/?, we can 
find the derivative, according to (2.3), like this: 

f'(t) = (e at (cos f3t + i sin /3t)) = -y- ( e at cos / 3t ) + ( e at sin / 3t ) 

(Jib (Lb (Lb 

= ae at cos / 3t — e at j3 sin / 3t + i (ae at sin / 3t + e at (3 cos (3t) 

= e at (a + i/3) (cos / 3t + i sin f3t) = ce ct . 



□ 

Similarly, integration can be defined by splitting into real and imaginary 
parts. If I is an interval, bounded or unbounded, 



J f(t) dt = J ( u(t ) + iv(t)) dt = J u(t) dt + ij v{t) dt. 




2.2 Complex- valued functions of a real variable 19 



If the interval is infinite, the convergence of the integral on the left is 
equivalent to the simultaneous convergence of the two integrals on the 
right. 

A number of familiar rules of computation for differentiation and inte- 
gration can easily be shown to hold also for complex- valued functions, with 
virtually unchanged proofs. This is true for, among others, the differentia- 
tion of products and quotients, and also for integration by parts. The chain 
rule for derivatives of composite functions also holds true for an expression 
such as /(#(£)), when g is real- valued but / may take complex values. 

Absolute convergence of improper integrals follows the same pattern. 
From (2.2) it follows, by the comparison test for generalized integrals, that 
f f is absolutely convergent if and only if f u and f v are both absolutely 
convergent. 

The fundamental theorem of calculus holds true also for integrals of com- 
plex-valued functions: 



!/„ mdt= /(x> - 



Example 2.5. Let c be a non-zero real number. To compute the integral 
of e ct over an interval [a, 6] , we can use the fact that e ct is the derivative of 
a known function, by Example 2.4: 



/ 



r „cti 



e ct dt 



t=b 



t—a 



e cb - e ca 



□ 



When estimating the size of an integral the following relation is often 
useful: 





dt. 



Here the limits a and b can be finite or infinite. This is rather trivial if / 
is real-valued, so that the integral of / can be interpreted as the difference 
of two areas; but it actually holds also when / is complex- valued. A proof 
of this runs like this: The value of f(t) dt is a complex number /, which 
can be written in polar form as \I\e ia for some angle a. Then we can write 
as follows: 




= |/| = e~ ia f f(t ) dt = I** e~ ia f(t) dt = Re T e~ ia f(t) dt 
J a J a J a 

= ( b R e{e- ia f(t)}dt< [ b \e- ia f(t)\dt= f \f{t)\dt. 

J a J a J a 



Here we used that the left-hand member is real and thus equal to its own 
real part. 
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Exercises 

. .2 

2.7 Compute the derivative of f(t) = e by separating into real and imaginary 
parts. Compare the result with that obtained by using the chain rule, as if 
everything were real. 

2.8 Show that the chain rule holds for the expression f{g(t )), where g is real- 
valued and / is complex- valued, and t is a real variable. 



2.9 Compute the integral 




where n is an arbitrary integer (positive, negative, or zero). 



2.3 Cesaro summation of series 



We shall study a method that makes it possible to assign a sort of “sum 
value” to certain divergent series. For a convergent series, the new method 
yields the ordinary sum; but, as will be seen in Chapter 4, the method is 
really valuable when studying a series which may or may not be convergent. 

Let a,k be terms (real or complex numbers), and define the partial sums 
s n and the arithmetic means o n of the partial sums like this: 



n 

s n — ^ ^ 
k = 1 






S 1 + s 2 + * * ' + s n 

n 



1 

n 



n 



J2 S k- 



(2.4) 



Lemma 2.1 

Then also 



oo 

Suppose that the series ^2 a & is convergent with the sum s. 
k= 1 



lim a n — s. 

n— >oo 



Proof. Let e > 0 be given. The assumption is that s n s as n — > oo. This 
means that there exists an integer N such that |s n — s\ < s/2 for all n > N. 
For these n we can write 



\C Tn “ S = 



\si + 5 2 H b s n - ns\ 



n 



— ~ I ($i ~ s) + • • • + (sn — s) + (sjv+i — 5) + • • • + ( s n — s ) I 
n 1 

1 v— > . . 1 ^ 1 ,1 „ 1 . n S C £ 

n ^ ^ n rt n ). n / 



k = 1 



k=N + 1 



Here, C is a non-negative constant (that does not depend on n), and so, 
if n > 2(7/£, the first term in the last member is also less than e/2. Put 
no = max(iV, 2 C/e). For all n > no we have then \a n — s | < £, which is the 
assertion. □ 
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Definition 2.1 Let s n and cr n be defined as in (2.4). We say that the 
series YlkLi i s summable according to Cesaro or Cesaro summable 

or summable (C, 1) to the value, or “sum”, s, if lim a n = s. 

n — too 



We write 

oo 

X+ = s (C, 1). 

k=i 

The lemma above states that if a series is convergent in the usual sense, 
then it is also summable (C, 1), and the Cesaro sum coincides with the 
ordinary sum. 

Example 2.6. Let ak = (— l) fe-1 , k = 1,2,3,..., which means that we 
have the series 1 — 1 + 1 — 1 + 1 — 1 + • • •. Then s n = 0 if n is even and 
s n = 1 if n is odd. The means a n are 

1 . f . |( n+1 ) n+1 VC • 

a n = - it n is even, o n = = ii n is odd. 

2 n 2n 

Thus we have o n — > \ as n — > oo. This divergent series is indeed summable 
(C, 1) with sum □ 

The reason for the notation (C, 1) is that it is possible to iterate the 

process. If the a n do not converge, we can form the means r n = (g\ H (- 

<j n )/n. If the r n converge to a number s one says that the original series is 
(C, 2)-summable to s, and so on. 

These methods can be efficient if the terms in the series have different 
signs or are complex numbers. A positive divergent series cannot be summed 
to anything but -boo, no matter how many means you try. 

Exercises 

2.10 Study the series 1 + 0 — 1 + 1 + 0 — 1 + 1 + 0 , i.e., the series 

where a 3 k+i = 1, a 3 fc +2 = 0 and a 3 k +3 = — 1. Compute the Cesaro means 
On and show that the series has the Cesaro sum | . 

2.11 The results of Example 2.6 and the previous exercise can be generalized as 

follows. Assume that the sequence of partial sums s n is periodic, i.e., that 
there is a positive integer p such that s n + p = s n for all n. Then the series 
is summable (C, 1) to the sum o = (si + S 2 H b s p )/p. Prove this! 

2.12 Show that if has a finite (C, 1) value, then 

lim ^ = 0. 

n— >oo n 

What can be said about lim ak/kl 

k-+oo 

2.13 Prove that if ak > 0 and J^a/e is (C, l)-summable, then the series is con- 
vergent in the usual sense. (Assume the contrary - what does that entail 
for a positive series?) 
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2.14 Show that the series ^ * s no ^ summable (C, 1). Also show that 

it is summable (C, 2). Show that the (C, 2) sum is equal to — |. 

2.15 Show that, if x ^ n • 27r (n 6 Z), 



oo 

\ + cos kx = 0 (C, 1). 

k=i 



2.16 Prove that 



J2 zn = r^z (C7,1) for |z| < 1, z^l. 

71 = 0 



2.4 Positive summation kernels 



In this section we prove a theorem that is useful in many situations for 
recovering the values of a function from various kinds of transforms. The 
main idea is summarized in the following formulation. 

Theorem 2.1 Let I = (—a, a) be an interval (finite or infinite). Suppose 
that {K n }™ =1 is a sequence of real-valued, Riemann-integrable functions 
defined on I, with the following properties: 

(1) K n (s)> 0. 

(2) [ K n (s)ds = 1. 

J —a 

(3) If S > 0, then lim / K n (s) ds = 0. 

n— >oo J 

5<\s\<a 

If f : I C is integrable and bounded on I and continuous for s = 0, we 
then have 

lim [ K n (s) f(s) ds = /( 0). 

n—, kx) / „ 

J —a 

Proof. Let e be a positive number. Since / is continous at the origin there 
exists a number 5 > 0 such that 



\s\<6 => \f(s)~ f(0)\<s. 

Furthermore, / is bounded on 7, i.e., there exists a number M such that 
|/(5) | < M for all s. Because of the property 2 we have 



A := f K n (s)f(s)ds-f(0)=[ K n (s) f(s) ds - /(0) f K n (s)ds 

J —a J —a J —a 

- [ K n (s)(f(s) - f(0))ds. 

J —a 
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We want to prove that A -» 0 as n -* oo. Let us estimate the absolute 
value of A, assuming that |s| < <5: 



|A| = 



J K n (s)(f(s) - /(0))ds < J K n (s) |/(s) - /(0)| ds 

-a I —a 

5 

= J K n (s)\f(s) - f(0)\ds+ J K n (s)\f(s) - f{0)\ds 

—6 5<|s|<a 

<s J K n (s)ds+ j K n (s) 2M ds < e + 2M j K n (s)ds. 



6<|s|<a 



<5<|s|<a 



The last integral tends to zero, by the assumptions, and so the second term 
of the last member is also less than e if n is large enough. This means that 
for large n we have |A| < 2e, which proves the theorem. □ 

A sequence {K n }™ =1 having the properties 1-3 is called a positive sum- 
mation kernel We illustrate with a few simple examples. 

Example 2.7. Define K n : R -» R by 



K n {s) 



n, |s| < 1/(2 n), 
0, |s| > l/(2n) 



(see Figure 2.1a). It is obvious that the conditions 1-3 are fullfilled. See 
also Exercise 2.17. □ 



Example 2.8. Let <p(s) = e~ s2 / 2 /\/2n, the density function of the normal 
probability distribution (Figure 2.1b). Define K n (s) = rup{ns). Then {K n } 
is a positive summation kernel on R (check it!). □ 



Example 2.9. The preceding example can be generalized in the following 
way: Let : R -> R be some function satisfying 'ip(s) >0 and f R 'ip(s) ds = 
1. Putting K n (s) = mp(ns), we have a positive summation kernel. □ 
The examples should help the reader to understand what is going on: a 
positive summation kernel creates a weighted mean value of the function /, 
with the weight being successively concentrated towards the point s = 0. 
If / is continuous at that point, the limit will yield precisely the value of / 
at s = 0. 

A corollary of Theorem 2.1 is the following, where we move the concen- 
tration of mass to some other point than the origin: 

Corollary 2.1 If {K n }^ =1 is a positive summation kernel on the interval 
I, So is an interior point of I, and f is continuous at s = so, then 

K n (s) f(s 0 — s)ds = f(s 0 ). 
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The proof is left as an exercise (do the change of variable so — s = u). 

Remark. The choice of the interval I is often rather unimportant. It is also easy 
to see that the condition 2 can be weakened, e.g., it suffices that the integrals of 
K n over the interval tend to 1 as n — > oo. In consequence, kernels on all of R can 
also be used on any subinterval R having the origin in its interior. □ 

Remark. The reader who is familiar with the notion of uniform continuity , can 
appreciate a sharper formulation of the corollary: if / is continuous on a compact 
interval if, the functions 



F n (t) = J K n (s)f(t - S ) ds 

will converge to f(t) uniformly on if. The proof is practically unchanged, with 
the only addition that the number 6 occuring in the proof of Theorem 2.1 can be 
chosen so that it can be used simultaneously for all values of t that are involved. 

□ 



Exercises 

2.17 Prove directly, without using the theorem, that if if n is as in Example 2.7 
and / is continuous at the origin, then lim f K n (s)f(s) ds = /( 0). 

n— ►oo 

2.18 Prove that the “roof functions” g n , defined by g n (t) = n — n 2 t for 0 < 
t < 1/n, g n (t) = 0 for t > 1/n and g n (—t ) = #n(£), make up a positive 
summation kernel. Draw pictures! 

2.19 (a) Show that K n (t) = | ne~ n ^ describes a positive summation kernel. 

(b) Suppose that / is bounded and piecewise continuous on R, and 
lim f(t) = 1, lim f(t) = 3. Show that 



lim £ [ 

n->oo 2 



f(t) dt = 



2 . 
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2.20 Show that if / is bounded on R and has a derivative /' that is also bounded 
on R and continuous at the origin, then 

lim -^= [ se~ n 6,2/2 f(s) ds = /'(0). 
n->oo V27T Jr 



2.21 Let (p be defined by p(x) = y§(x 2 — l) 2 for \x\ < 1 and p(x) = 0 otherwise. 
Let / be a function with a continuous derivative. Find the limit 



lim 

n — >oo 




n 2 ip'(nx) f(x) dx. 



2.5 The Riemann-Lebesgue lemma 



The following theorem plays a central role in Fourier Analysis. It takes 
its name from the fact that it holds even for functions that are integrable 
according to the definition of Lebesgue. We prove it for functions that are 
absolutely integrable in the Riemann sense. First, let us very briefly recall 
what this means. 

A bounded function / on a finite interval [a, b] is integrable if it can be 
approximated by Riemann sums from above and below in such a way that 
the difference of the integrals of these sums can be made as small as we 
wish. This definition is then extended to unbounded functions and infinite 
intervals by taking limits; these cases are often called improper integrals. If 
I is any interval and / is a function on I such that the (possibly improper) 
integral 

j\f{u)\du 

has a finite value, then / is said to be absolutely integrable on I. 

Theorem 2.2 (Riemann-Lebesgue lemma) Let f be absolutely inte- 
grable in the Riemann sense on a finite or infinite interval I. Then 

lim / f(u) sin Xudu = 0. 

A^oo Jj JK 



Proof. We do it in four steps. First, assume that the interval is compact, 
/ = [a, 6], and that / is constant and equal to 1 on the entire interval. Then 



pb pb 

/ / (u) sin Xu du = / sin Xudu 
J a J a 



COS Xu 



-i u—b 



= -(cos Xa — cos A 5), 
A 



which gives 




sin Xu du 




as A oo. 
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The assertion is thus true for this /. 

Now assume that / is piecewise constant , which means that I (still as- 
sumed to be compact) is subdivided into a finite number of subintervals 
h = (afc_i,ajfe), k = 1,2, ...,iV (ao = a, a^- = 6), and that /(-a) has a 
certain constant value Ck for u £ Ik- This means that we can write 

N 

/(« ) = J2c k g k (u), 

k = 1 



where = 1 on Ik and gk(u) = 0 outside of Ik- We get 

j>b N ^ N ^ » ak 

/ / (a) sin Atx du = ^ I Ck gk (u) sin Aa du = Ck / sin Aa du. 

Jd k=1 Ja fc =1 ^fc-i 

This is a sum of finitely many terms, and by the preceding case each of 
these terms tends to zero as A — > oo. Thus the assertion is true also for this 



/• 

Let now / be an arbitrary function that is Riemann integrable on I = 
[a, b]. Let e be an arbitrary positive number. By the definition of the Rie- 
mann integral, there exists a piecewise constant function g such that 

J \f{u)-g(u)\du< 

(Let g be a function whose integral is a Riemann sum of /.) Then, 



f b 

1 f (u) sin Xudu 


— 


/ (fi u ) ~ 9 ( u )) si n A u du + 


/ g(u) sin Xudu 


J a 


< 


Ja J 

f |/(u) -0(u)||sinAu|du + 


'a 

1 f b 

/ g(u) sin Ait du 






J a 

f b 


1 

1 fb 


J a 


1 




< 


/ \f(u)-g(u)\du + 
J a 


/ g(u) sin Xudu 
J a 





The last integral tends to zero as A — > oo, by the preceding case. Thus there 
is a value Ao such that this integral is less that e/2 for all A > Ao- For these 
A, the left-hand member is thus less than e, which proves the assertion. 

Finally, we no longer require that I is compact. Let e > 0 be prescribed. 
Since / is absolutely integrable, there is a compact subinterval J CL I such 
that J f ^j \f(u) \ du < e. We can write 

[ f(u) sin Xu du 

i 



< 



/ f(u) sin Xu du + / | 

Jj Ji\j 



\f(u)\du , 




2.6 *Some simple distributions 27 



where the first term tends to zero by the preceding case, and thus it is less 
than e if A is large enough; the second term is always less than e. This 
completes the proof. □ 

The intuitive content of the theorem is not hard to understand: For large 
values of | A|, the integrated function f{u) sin Xu is an amplitude-modulated 
sine function with a high frequency; its mean value over a fixed interval 
should reasonably approach zero as the frequency increases. Of course, 
the factor sin Xu in the integral can be replaced by cos Xu or the complex 
function e* An , with the same result. And, of course, we can just as well let 
A tend to — oo. 



2.6 *Some simple distributions 

In this section, we introduce, in an informal way, a sort of generalization of 
the notion of a function. (A more coherent and systematic way of defining 
these objects is given in Chapter 8.) As a motivation for this generalization, 
we begin with a few “examples.” 

Example 2.10. In Sec. 1.3 (on the wave equation) we saw difficulties in the 
usual requirement that solutions of a differential equation of order n shall 
actually have (maybe even continuous) derivatives of order n. Quite natural 
solutions are disqualified for reasons that seem more of a “bureaucratic” 
nature than physically motivated. This indicates that it would be a good 
thing to widen the notion of differentiability in one way or another. □ 

Example 2.11. Ever since the days of Newton, physicists have been 
dealing with situations where some physical entity assumes a very large 
magnitude during a very short period of time; often this is idealized so 
that the value is infinite at one point in time. A simple example is an elas- 
tic collision of two bodies, where the forces are thought of as infinite at 
the moment of impact. Nevertheless, a finite and well-defined amount of 
impulse is transferred in the collision. How is this to be treated mathemat- 
ically? □ 

Example 2.12. A situation that is mathematically analogous to the 
previous one is found in the theory of electricity. An electron is considered 
(at least in classical quantum theory) to be a point charge. This means that 
there is a certain finite amount of electric charge localized at one point in 
space. The charge density is infinite at this point, but the charge itself has 
an exact, finite value. What mathematical object describes this? □ 

Example 2.13. In Sec. 2.4 we studied positive summation kernels. These 
consisted of sequences of nonnegative functions with integral equal to 1, 
that concentrate toward a fixed point as a parameter, say, N, tends to 
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infinity, for example. Can we invent a mathematical object that can be 
interpreted as the limit of such a sequence? □ 

The problems in Examples 2.11 and 2.12 above have been addressed by 
many physicists ever since the later years of the nineteenth century by 
using the following trick. Let us assume that the independent variable is t. 
Introduce a “function” S(t) with the following properties: 

(1) S(t) > 0 for - oo < t < oo, 

(2) S(t) = 0 for t 7 ^ 0, 

/ oo 

5{t)dt = l. 

-OO 

Regrettably, there is no ordinary real- (or complex) -valued function that 
satisfies these conditions. Condition 2 irrevocably implies that the integral 
in condition 3 must be zero. Nevertheless, using formal calculations involv- 
ing the “function” 5, it was possible to arrive at results that were both 
physically meaningful and “correct.” A name that is commonly associated 
with this is P. Dirac, but he was not the only person (nor even the first 
one) to reason in this way. He has, however, given his name to the object 
5: it is often called the Dirac delta function (or the Dirac measure, or the 
Dirac distribution). 

One way of making legitimate the formal 5 calculus is to follow an idea 
that is indicated in Example 2.13. If S occurs in a formula, it is at first 
replaced by a positive summation kernel upon this we then do our 
calculations, and finally we pass to the limit. In a certain sense (which will 
be made precise in Chapter 8), it is true that S = lim Kn. 

N — >-oo 

In this section, and in certain star-marked sections in the following chap- 
ters, we shall accept the delta function and some of its relatives in an intu- 
itive way. Thus, S(t) stands for an object that acts on a continuous function 
(p according to the formula 

J d(t)<p(t) dt = ip{ 0), 

where the integral is taken over some interval that contains the origin in 
its interior. We also introduce the translates S a , which can be described 
by either 5 a (t) = 5(t — a) or f S a (t)(p(t) dt — p(a). It is essential that the 
point a, where the “pulse” appears, is located in the interior of the interval 
of integration. If the point coincides with an endpoint of the interval, the 
integral is not considered to be well-defined. 

Example 2.14. The Laplace transform of a function / is defined to be 
another function /, given by 

fi s )= f(t)e~ st dt 
Jo 




2.6 *Some simple distributions 29 



for all s such that the integral is convergent (see Chapter 3). The Laplace 
transform of S cannot be defined in this way. We can, however, modify the 
definition so as to include the origin. It is indeed customary to write 




f(t)e~ st dt 



lim 

fc/*0 



f 



f{t) e 



-st 



dt. 



With this definition one finds that 5(s) = 1 for all s. Similarly, <5 a (s) = e as , 
if a > 0. □ 

The Heaviside function, or unit step function, H is defined by 



H(t) = 



0 for t < 0, 

1 for t > 0. 



The value of H{ 0) is mostly left undefined, because it is normally of no 
importance. The Heaviside function is useful in many contexts. One of 
these is when we are dealing with functions that are given by different 
formulae in different intervals. 

If a < b, the expression H(t — a) — H(t — b) is equal to 1 for a < t < b and 
equal to 0 outside the interval [a, b\. It might be called a “window” that 
lights up the interval (a, b ) (we do not in these situations care much about 
whether an interval is open or closed). For unbounded intervals we can also 
find “windows”: the function H(t — a) lights up the interval (a, oo), and 
the expression 1 — H(t — b) the interval (— oo, b). 

Example 2.15. Consider the function / : R — > R that is given by 



m = i 



1 — t 2 for t < —2, 
t + 2 for - 2 <t < 



1, 



I 1 — t for t > 1. 



This can now be compressed into one formula: 

/(*) 

- (1 - t 2 )( 1 - H{t + 2)) + (t + 2 )(H{t + 2 )-H(t - 1)) + (1 - t)H{t - 1) 

— (1 — t + ( — 1 + 1 2 + 1 ~h 2 }H{t + 2) H - ( — t — 2 + 1 — t^H{t — 1) 

= 1 — t 2 + it 2 + 1 + 1)j H( t + 2) — (2 1 + 1).H(£ — 1). 



□ 

Heaviside’s function is connected with the S function via the formula 
H(t) = f S(u) du. 

J — OO 

A very bold differentiation of this formula would give the result 

H'(t) = 8(t). 



(2.5) 
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Since H is constant on the intervals ] — oo, 0[ and ]0, oo[, and S(t) is consid- 
ered to be zero on these intervals, the formula (2.5) is reasonable for t ^ 0. 
What is new is that the “derivative” of the jump discontinuity of H should 
be considered to be the “pulse” of 5. In fact, this assertion can be given a 
completely coherent background; this will be done in Chapter 8. 

If (f is a function in the class C 1 , i.e., it has a continuous derivative, and 
if in addition (p is zero outside some finite interval, the following calculation 
is clear: 

/ oo poo 

<p'{t)H{t) dt — <p'(t ) dt = [p(t)] ~ 0 = 0 - ¥>(0) = -¥?(0). 

-oo J 0 



The same result can also be obtained by the following formal integration 
by parts: 



/ OO f "I OO poo 

(p\t)H(t) dt= <p(t)H(t)\ — (p(t)H f (t)dt 

-oo L J — OO J — oo 

/ oo 

p(t)5(t) dt = — <^(0). 

-oo 



This is characteristic of the way in which these generalized functions can be 
treated: if they occur in an integral together with an “ordinary” function 
of sufficient regularity, this integral can be treated formally, and the results 
will be true facts. 

One can go further and introduce derivatives of the 5 functions. What 
would be, for example, the first derivative of 6 = Jo ? One way of finding out 
is by operating formally as in the preceding situation. Let (p be a function 
in C 1 , and let it be understood that all integrals are taken over an interval 
that contains 0 in its interior. Since S(t) = 0 if t ^ 0, it is reasonable that 
also 5'(t) = 0 for t ^ 0. Integration by parts gives 



/ 



6'(t)ip(t) dt = 



S(t)ip{t) 



f 



S(t)p'(t) dt = (0 — 0) — ^ ; (0) = — ^'(0). 



If S itself serves to pick out the value of a function at the origin, the 
derivative of 5 can thus be used to find the value at the same place of 
the derivative of a function. 

Another way of seeing 5 f is to consider 5 to be the limit of a differentiable 
positive summation kernel, and taking the derivative of the kernel. An 
example is actually given in Exercise 2.20. As in Example 2.8 on page 23, 
we study the summation kernel 



Kn(t) = 




-n 2 t 2 / 2 



(which consists in rescaling the normal probability density function). The 
derivatives are 

-n 2 t 2 / 2 



K n \t) = - 



nH 

7 ^' 
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These are illustrated in Figure 2.2. The fact that they approach —S'(t) is 
proved by integration by parts (which is what Exercise 2.20 is all about). 

In the theory of electricity, there occurs a phenomenon known as an 
electric dipole. This consists of two equal but opposite charges ±q at a 
small distance from each other (see Figure 2.3). If the distance is made 
smaller and charges increase in proportion to the inverse of the distance, 
the “limit object” is an idealized dipole. A mathematical model of this 
object consists of 5', just as a a point charge can be represented by 6. 

Higher derivatives of 8 can also be defined. Using integration by parts 
one finds that the nth derivative 8 ^ should act according to the formula 

1 8W(t)<p{t)dt = (- l)V n) (0), 



provided the function p has an nth derivative that is continuous at the 
origin. 
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Exercises 

2.22 Compute the following integrals (taken over the entire real axis if nothing 
else is indicated): 

(a) J(t 2 + 3t)(5(t) — S(t -f 2)) dt (b) J 0 °° e~ st 5'(t — 1) dt 
(c) / e 2t S'(t) dt (d) S M (t) e~ st dt 

2.23 What should be meant by 5(2t), expressed using S(t) ? Investigate this by 
manipulating J cp(t)5(2t) dt in a suitable way. Generalize to <5(a£), a ^ 0. 
(The cases a > 0 and a < 0 should be considered separately.) 

2.24 Rewrite, using Heaviside windows, the expressions f\{t) = t\t + 1|, / 2 (t) = 
e - ^, fs(t) = sgn t = t/\t\ ( t ^ 0), f 4 (t) = A if t < a, = B if t > a. 

2.7 * Computing with 8 

We shall now show how one can solve certain problems involving the 5 
distribution and its derivatives. 

The ordinary rules for computing with derivatives will still hold true. 
(We cannot really prove this at the present stage.) For example, the rule 
for differentiating a product is valid: (fg)' = f'g + fg f . 

Example 2.16. If x is a function that is continuous at a, what should be 
meant by the product ? Since S a (t) is “zero” except for at t = a, it 

can be expected that the values of x(t) for t ^ a should not really matter. 
And we can write as follows: 

J (x(t)6a(t))<p(t)dt = J 6 a (t)(x(t)<p(t)) dt = x(a)ip(a). 

There is no way to distinguish x(£)5 fl (£) from x( a )$a(t)- Thus we have a 
simplification rule: the product of a delta and a continuous function is equal 
to a scalar multiple of the delta, with coefficient equal to the value of the 
function at the point where the pulse sits: 

= x{a)Sa(t). (2.6) 

If we encounter derivatives of <5, the matter is more complicated. What 
happens is this: start from (2.6) and differentiate: 

x'(t)6 a {t) + x(t)K(t) = xWait)- 

In the first term we can replace x'(t) by x'( a ) an d then move this term to 
the other side. We get 

x(t)S'a(t) = x(o)^W - x'(a)$a(t). 

(On second thought, it should not be surprising that the product of a 
function and a 5 f somehow takes into account the value of the derivative 
of the function as well.) 
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What happens when the second derivative is multiplied by a function is 
left to the reader to find out (in Exercise 2.25). □ 

Example 2.17. Find the first two derivatives of /(£) = \t\. 

Solution. Rewrite the function without modulus signs, using Heaviside win- 
dows: 

f{t) = |t| = -t( 1 - H(t)) + tH(t) = 2tH(t) - t. 
Differentiation then gives 

f{t) = 2 H(t) + 2 tS(t) - 1 = 2 H(t) - 1. 

In the last step we used the formula (2.6). In plain language, the derivative 
of \t\ is plus one for positive t and minus one for negative £, just as we 
know from elementary calculus; at the origin, the value of the derivative is 
undecided. We proceed to the second derivative: 

f"(t) = 26(t)-0 = 2S{t). 

This formula reflects the fact that f' has derivative zero everywhere outside 
the origin; whereas at the origin, the delta term indicates that /' has a 
positive jump of two units. This is characteristic of the derivative of a 
function with jumps. □ 

Example 2.18. Another example of the same type, though more compli- 
cated. The function f(x) = \x 2 — 1| can be rewritten as 

f(x) = (x 2 - 1 )H(x - 1) + (1 — x 2 )(H(x + 1) — H(x — 1)) 

+(x 2 - 1)(1 - H(x + 1)) 

- (x 2 - 1) (2 H(x - 1) - 2 H(x + 1) + l) . 

This formula can be differentiated, using the rule for differentiating a prod- 
uct: 

f(x) = 2x{2 H(x — 1) - 2 H(x + 1) + l) + (x 2 — l)(25(x - 1) — 2S(x + 1)) 

= 2x{2 H(x - 1) - 2 H{x + 1) + 1). 

In the last step, we used (2.6). One more differentiation gives 

/"(:r) = 2{2H(x - 1) - 2H{x + 1) + l) + 2x(2S{x - 1) - 2 S(x + 1)) 

= 2(2H(x - 1) - 2H(x + 1) + 1) + 4 S(x - 1) + 46(x + 1). 

The first term contains the classical second derivative of \x 2 — 1|, which 
exists for x ^ ±1; the two 5 terms demonstrate that /' has upward jumps 
of size 4 for x = ±1. The reader should draw pictures of /, /', and /". □ 

In the two last examples, the first derivative at the “corners” of / is 
considered to be undecided. The classical point of view is to say that / does 
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not have a derivative at such a point; when working with distributions, the 
derivative is thought of as more of a global notion, that always exists, but 
may lack a precise value at certain points. 

Example 2.19. Solve the differential equation y f + 2y = 5(t — 1) for t > 0 
with the initial value y( 0) = 1. 

Solution. The method of integrating factor can be used. An integrating 
factor is e 2t : 

e 2 y + 2 e 2t y = e 2t S(t - l) ^ ± (e 2t y ) = eH{t - 1). 

In rewriting the right-hand side we used (2.6). Now we can integrate: 



e 2t y = e 2 H(t - 1) + (7, 



where C is a constant. To satisfy the initial condition, we must take (7 = 1. 
Thus the solution is 



y = e 2 ~ 2t H(t - 1) + e 2t , t > 0. 

(The reader is recommended to check the solution by differentiation and 
substitution into the original equation.) □ 

Example 2.20. Find all solutions of the differential equation y" + 4y = S. 

Solution. The classical method for this sort of problem amounts to first 
finding the general solution of the corresponding homogeneous equation, 
which is yu — <7icos2t + (72sin2£, where C\ and C 2 are arbitrary con- 
stants. Then we should find some particular solution of the inhomogeneous 
equation. What kind of expression yp could possibly, after differentiation 
and substitution into the left-hand side of the equation, yield the result 
S ? Apparently, something drastic happens at t = 0. Since 5(t) = 0 for 
t < 0, the equation can be said to be homogeneous during this period 
of time. Let us then guess that there is a particular solution of the form 
y P (t) = u(t)H(t ), where u(t) is to be determined. Differentiation gives 

yp(t) = u'(t)H(t) + u(t)H f (t ) = u'(t)H(t) + u(0)5(t), 
y'^(t) = u , \t)H{t)W{t)H\t)+u{0)S , {t) 

= u f \t)H(t)+u\0)5(t)+u(0)6'{t). 

Substitution into the equation gives 

(u"(t)H(t) + u'(0)5(t) + u{0)S'{t)) + 4 u(t)H(t) = 6{t) 
or 

(u”(t) + 4 u(t))H(t) + u'(0)5(t) + u(0)S'(t) = S(t). 
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The function u should be chosen so that u" + Au = 0, u'( 0) = 1 and 
u(0) = 0. This means that u(t) = a cos 2t + b sin 2£, where 0 = u(0) = a and 
1 = ti'(0) = 2b. Thus, u = \ sin2£, and yp = \ sin 2 tH(t). The solutions of 
the problem are thus 

y = Ci cos 2 1 + (C 2 + \H(t)) sin 2 t. 

Again, the reader is recommended to check the solution. □ 

Example 2.21. In Sec. 1.3, on the wave equation, the final example turned 
out to have a solution that was not really a differentiable function. Now 
we can put this right, by allowing the generalized derivatives introduced in 
this section. The solution involved the function </?, defined by 

(p(s) = \ s — \ C for s > 0, <p(s) = — \ s — \ C for s < 0. 

We can rewrite this definition, using Heaviside windows: 

¥>(*) = H s - 5 CX 1 - )) + (I * - i C)H(s) = 4 S-\C + sH(s). 

The two first derivatives are 

¥>'(«) = -§ + H(s) + s6(s) = -\ + H(s), ¥>"(s) = 6(a). 

The complete solution of the problem in Sec. 1.3 can be written 

u(x, t) — <p(x - t) + \j)(x + t) = (f(x - t) -h + 1) + \ C. 

Differentiating, and trusting that the chain rule holds as usual (which it 
does, as will be proved in Chapter 8), we find 

u x = <p'(x — *) + § = 1 + H{x - t), u xx = S(x - t) 

and 

u t = -(p'(x -t) + | = 2- H(x - t), u tt = S(x - t). 

Thus, u xx = u t t as distributions, and u can be considered as a worthy 
solution of the wave equation. □ 

Exercises 

2.25 Find a simpler expression for where x is a C 2 function. 

2.26 Determine the derivatives of order < 2 of the functions f(t) = e - ^, g(t) = 
|t|e~^ and h(t) = | sin 1 1 . Draw pictures! 

2.27 Let / : R -> R be given by f(x ) = 1 - x 2 if -1 < x < 1 and f(x) = 0 
otherwise. Find and then simplify the expression ( x 2 — 1 )f"(x) as far 
as possible. 

2.28 Find the derivatives /' and /", if /(£) = |£ 3 — t\. Sketch the graphs of /, 
f of /" in separate pictures. 
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2.29 Find the general solution of the differential equation -j- + 2 ty = 8(t — a). 

2.30 Solve the problems (a) y"—y = tH(t+ 1), (b) y" +3y' +2y = tH(t)+8'(t). 

2.31 Find y = y(x) that satisfies (1 + x 2 )y f — 2 xy = S(x — 1) and 7/(0) = 1. 

2.32 Establish the following formula for an antiderivative (F being an antideriva- 
tive of /): 

J f(t)H(t — a) dt = ( F(t ) - F(a))H(t -a) + C. 

2.33 Find a function y = y(x) such that y' + 2 xy = 2 xH(x) — S(x — 1) and 
2 /( 2 ) = 1. (Hint: the result of the preceding exercise may be useful.) 



Historical notes 

Complex numbers began to pop up as early as the Renaissance era, when scholars 
such as Cardano began solving equations of third and fourth degrees. But not 
until Leonhard Euler (1707-83) did they begin to be accepted as just as natural 
as the real numbers. The study of complex- valued functions was intensified in the 
nineteenth century; some famous names are Augustin Cauchy (1789-1857), 
Bernhard Riemann (1826-66), and Karl Weierstrass (1815-97). 

The idea of “summing” certain divergent series was made precise by math- 
ematicians such as the young Norwegian Niels Henrik Abel (1802-29) and 
Carl Friedrich Gauss (1777-1855). The method presented in Sec. 2.3 is due 
to the Italian mathematician Ernesto Cesaro (1859-1906), but the German 
Otto Holder (1859-1937) had the same idea at about the same time. 

Riemann is the originator of an integral definition which is even today in uni- 
versal use for elementary education. His definition has certain disadvantages, that 
were remedied by Henri Lebesgue (1875-1941) in his 1900 thesis. The Lebesgue 
integral is, however, even after one century considered to be too complicated to 
be included in elementary courses. 

The theory of distributions is chronicled after Chapter 8. 



Problems for Chapter 2 

2.34 Show that the function f(z) = e 4z has period ni/2. 

2.35 Let / be a continuous function on R. Assume that we know that it has 
period 2i r and that it satisfies the equation 

m = - i*) + f(t + ±tt)) for all t G R. 

Show that / in fact has a period shorter than 2i r, and determine this period. 

2.36 Let (p be a C 1 function such that </? and <// are bounded on the real axis. 
Compute the limit 

2n 3 f°° x 

iim / . <p{x)dx. 

n— »oo 7 r J (l+n 2 z 2 ) 2 
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2.37 Let F(x) = (1 — x 2 )(H(x -f 1) — H(x — 1)). Let g be continuous on the 
interval [—1,1]. Find the limit 



lim f 

n— >oo 



j: 



nF(nx) g{x) 



dx. 



2.38 

2.39 



Find the derivatives of order < 4 of f(t) = t 2 H(t). 

x 2 H~ 1 

Find y(x ) that solves the differential equation y' H y = 8(x — 2) and 

x 

satisfies y( 1) = 1. 



2.40 Let / : R R be described by f(x) = ( x 2 — 1 ) 2 (H(x + 1) — H(x — 1)). 
Show that / belongs to the class C 1 but not to C 2 . Also compute its third 
derivative. 




3 

Laplace and Z transforms 



3.1 The Laplace transform 

Let / be a function defined on the interval R+ = [0, oo[. Alternatively, we 
can think of f(t) as being defined for all real £, but satisfying f(t) = 0 for 
allt < 0. This can be expressed by writing 

f(t) = f(t)H(t), 

where H is the Heaviside function. Now let s be a real (or complex, if you 
like) number. If the integral 

poo 

/(«)=/ f(t) e st dt (3.1) 

Jo 

exists (with a finite value), we say that it is the Laplace transform of f , 
evaluated at the point s. We shall write, interchangeably, f(s) or £[/](«). 
In applications, one also often uses the notation F(s) (capital letter for the 
transform of the corresponding lower-case letter). 

Example 3.1. Let f(t) = e at , t > 0. Then, 




provided ttiat a — s < 0 so that the evaluation at infinity yields zero. Thus 
we have f(s) = l/(s — a) for s > a, or 

£K t] (s ) = 



s — a 



s > a. 
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In particular, if a = 0, we have the Laplace transform of the constant 
function 1: it is equal to 1/s for s > 0. □ 



Example 3.2. Let f(t ) = t,t> 0. Then, integrating by parts, we get 

-stl 00 



m= f 

Jo 



te st dt 



t‘ 



e~ st ^ 1 r° 

~ s . t = o s Jo 



1 • e st dt 



= 0+i£[l](.) = i 



This works for s > 0. □ 

It may happen that the Laplace transform does not exist for any real 
value of s. Examples of this are given by f(t) = 1/t, f(t) = e l . 

A profound understanding of the workings of the Laplace transform re- 
quires considering it to be a so-called analytic function of a complex vari- 
able, but in most of this book we shall assume that the variable s is real. We 
shall, however, permit the function / to take complex values: it is practical 
to be allowed to work with functions such as f(t) = e iat . 

Furthermore, we shall assume that the integral (3.1) is not merely conver- 
gent, but that it actually converges absolutely. This enables us to estimate 
integrals, using the inequality | f f \ < f |/|. 

Example 3.3. Let f(t ) = e lht . Then we can imitate Example 3.1 above 
and write 




e {ib ~ s)t dt 



I" e (ib—s)t 



oo 



ib — s 



t = o 



1 

ib — s 



[e st (cos bt + i sin bt)] . 



For s > 0 the substitution as t — > oo will tend to zero, because the factor 
e~ st tends to zero and the rest of the expression is bounded. The result is 
thus that C[e lbt ](s) = 1 /(s — ib), which means that the formula that we 
proved in Example 3.1 holds true also when a is purely imaginary. It is 
left to the reader to check that the same formula holds if a is an arbitrary 
complex number and s > Re a. □ 

It would be convenient to have some simple set of conditions on a function 
/ that ensure that the Laplace transform is absolutely convergent for some 
value of s. Such a set of conditions is given in the following definition. 



Definition 3.1 Let k be a positive number. Assume that f has the follow- 
ing properties: 

(i) / is continuous on [0, oo[ except possibly for a finite number of jump 
discontinuities in every finite subinterval; 

(ii) there is a positive number M such that \f(t)\ < Me kt for all t > 0. 
Then we say that f belongs to the class £&. If f G £*; for some value of k, 
we say that f E £. 
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Using set notation we can say that £ = |^J £&. Condition (ii) means that 

fc> o 

/ grows at most exponentially; this word lies behind the use of the letter 
£. If / G £*; for one value of fc, then also / G £fc for all larger k. 

Theorem 3.1 If f G £&, then f(s) exists for all s > k. 



Proof. We begin by observing that condition (i) for the class £& implies 
that the integral 




exists finitely for all s and all T > 0. Now assume s > k. Thus there exists 
a number M and a number to so that f(t)e~ kt < M for t > to- Then we 
can estimate as follows: 

t \f(t)\e~ st dt= j \f{t)\e- kt e- {s ~ k)t dt< f dt 

J to Jto Jto 

poo poo 

<M e - {a ~ k)t dt<M e - {s ~ k)t dt = < oo. 

Jt 0 Jo s — k 

This means that the generalized integral over [to, oo[ converges absolutely, 
and then this is equally true for the integral over [0, oo[. □ 

The result of the theorem can be “bootstrapped” in the following way. 
If do = inf{fc : / G £&}, then the Laplace transform exists for all s > go - 
Indeed, let k = (s + (Jo)/2, so that do < k < s; then / G £& (why?), and 
the theorem can be applied. The number <Jo is a reasonably exact measure 
of the rate of growth of the function /. In what follows we shall sometimes 
use the notation &o or cro{f) for this measure. 

As a consequence of the theorem we now know that a large set of common 
functions do have Laplace transforms. Among them are, e.g., polynomials, 
trigonometric functions such as sin and cos and ordinary exponential func- 
tions; also sums and products of such functions. If you have studied simple 
differential equations you may recall that these functions are precisely the 
possible solutions of homogeneous linear differential equations with con- 
stant coefficients, such as, for example, 

y(v) + 4y (iv) _ 8y »> + 15y " _ 24 y' = 0. 



We shall soon see that Laplace transforms give us a new technique for solv- 
ing these equations. We shall also be able to solve more general problems, 
like integral equations of this kind: 



f f(t - x ) f{x) dx + 3 f f(x) dx + 2t = 0, t > 0. 
Jo Jo 



(3.2) 



Another consequence of the theorem is worth emphasizing: if a Laplace 
transform exists for one value of 8, then it is also defined for all larger 
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values of s. If we are dealing with several different transforms having various 
domains, we can always be sure that they are all defined at least in one 
common semi-infinite interval. It is customary to be rather sloppy about 
specifying the domains of definition for Laplace transforms: we make a tacit 
agreement that s is large enough so that all transforms occuring in a given 
situation are defined. 

Exercises 

3.1 Let f(t) = e t , g(t) = e~* 2 . Show that / ^ £, whereas g G £& for all k. 

3.2 Compute the Laplace transform of f(t) = e at , where a = a + i(3 is a 
complex constant. 

3.3 Let f(t) = sint for 0 < t < i r, f(t) = 0 otherwise. Find f(s). 



3.2 Operations 



The Laplace transformation obeys some simple rules of computation and 
also some less simple rules. The simplest ones are collected in the follow- 
ing table. Everywhere we assume that s takes sufficiently large values, as 
discussed at the end of the preceding section. 



1. C[af + /3g](s) = af(s) + (3g(s), if a and (3 are constants. 

2. C[e at f(t)](s) = f(s — a), if a is a constant (damping rule). 

3. If we define f(t) = 0 for t < 0 and if a > 0, then 

C[f(t - a)](s) = e~ as f(s) (delaying rule). 



4. C[f(at)](s) = -/(s/a), if a > 0. 
a 

The proofs of these rules are easy. As an example we give the computa- 
tions that yield rules 3 and 4: 



poo 

^[/(i — a)](s) = / — 

Jo 



-st 



u—t — a 
dt l du = dt 

t = u = — a 
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Example 3.4. Using rule 1 and the result of Example 3.3 in the preceding 
section, we can find the Laplace transforms of cos and sin: 



C[cosbt](s) = \C[e m + e -i6t ](s) - 1 



1 1 

+ 



2 \s — ib s -{-ib 

£[ S i„5,] (s ) = I £ [e* - = 5 ■ - jhi 



S 2 + 6 2 ’ 

_ b 
s 2 + b 2 ' 



□ 



Example 3.5. Applying rule 2 to the result of Example 3.4 we get 

s — a „ r x b 



£[e at cos bt](s) = 



( s — a ) 2 + b 2 ’ 



£[e at sin bt](s) = 



(s — a) 2 + b 2 * 



□ 

A couple of deeper rules are given in the following theorems. 

Theorem 3.2 If f E £fc 0 , then ( t tf(t)) € £&* for k\ > ko and 

C[tf(t)}(s) = 

Proof. We shall use a theorem on differentiation of integrals. In order to keep 
it lucid, we assume that / is continuous on the whole of R+; otherwise we 
would have to split into integrals over subintervals where / is continuous, 
and this introduces certain purely technical complications. Since / G £fc 0 , 
we know that \f(t)\ < Me kot for some number M and all sufficiently large 
t, say t> t\. Let S > 0. Then there is a t<i such that \t\ < e 5t for t > t<i. If 
t > to = max(£i, £ 2 ) we have 

|*/(*)| < e st • Me kot = Me {ko + s)t = Me kl \ 

which means that tf(t ) belongs to £fc x and has a Laplace transform. 

If we differentiate the formula f(s) = J 0 °° f(t) e~ st dt formally with re- 
spect to s, we get ( f)\s ) = f£°{—t)f(t) e~ st dt. According to the theorem 
concerning differentiation of integrals, this maneuver is permitted if we can 
find a “dominating” function g (that may depend on t but not on s) such 
that the integrand in the differentiated formula can be estimated by g for 
alH > 0 and all values of s that we consider, and which is such that / 0 °° g 
is convergent. Let a be a number greater than the constant k\ and put 
g(t) — | tf(t) e~ at \. For all s > a we have then \(—t) f(t) e~ st \ < g(t ), and 

nOO nOO p OO 

/ g(t) dt = / \tf(t)\e~ at dt < M / e klt • e~ at dt 
Jo Jo Jo 

POO 

= M / e -(°-*i)t dt = 

Jo 



< OO. 
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This shows that the conditions for differentiating formally are fulfilled, and 
the theorem is proved. □ 

Example 3.6. We know that £[l](s) = 1/s for s > 0. Then we can say 
that 

; = -(-?) = ?• s>0 - 
Repeating this argument (do it!) we find that 

= S >°- 

□ 



Example 3.7. Also, rule 2 allows us to conclude that 

n t 

£[t"e°‘](s)= »> 0 . 

□ 

A sort of reverse of Theorem 3.2 is the following. The notation /(0+) 

stands for the right-hand limit lim f (t) = lim f(t). 

t-> o+ t\o 

Theorem 3.3 Assume that f E £ is continuous on R+. Also assume that 
the derivative f f (t) exists for all t > 0 (with /'( 0) interpreted as the right- 
hand derivative) and that f E £. Then 

£[/'](s) = s/(s) -/( 0+). 

Proof. Suppose that / G £fc 0 and /' G £fci, and take s to be larger than 
both k 0 and fci. Let T be a positive number. Integration by parts gives 

[ T fit) e~ st dt = f(T) e~ sT - /(0+) e° + s [ T ft) e~ st dt. 

Jo Jo 

When T — )> oo, the first term in the right-hand member tends to zero, and 
the result is the desired formula. □ 

Theorem 3.3 will be used for solving differential equations. 

The following theorem states a few additional properties of the Laplace 
transform. 

Theorem 3.4 (a) If f G £, then 

lim f(s) = 0. (3.3) 

s — »oo 

(b) The initial value rule: If f( 0+) exists , then 

lim sf(s) = /( 0+). 

s— ► oo 



(3-4) 
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(c) The final value rule: If f(t) has a limit as t -» +oo, then 



lim s/(s) = /(+oo) = lim f(t). 

s\0-|- t— >oo 



(3.5) 



In applications, the rule (3.5) is useful for deciding the ultimate or 
“steady-state” behavior of a function or a signal. 

Proof, (a) Let e > 0 be given and choose 5 > 0 so small that 

f \f(t)\dt<s. 

Jo 

Let k > 0 be such that / E £& and let so > k. Then for s > so we get 

pS pOO 

\M< / \f(t)\e- st dt+ / \f{t)\e~ st dt 
Jo Js 

pS poo 

< / \f(t)\dt+ / |/(*)| e— dt 

Jo Js 

poo 

<e + e -( s ~ So)5 J \f(t)\e~ Sot dt<£ + Ce-( s ~ 3o)s = £ + Ce Ss ° ■ e 



-Ss 



The last term tends to zero^as s — ► oo and thus it is less than e if s is large 
enough. This proves that \ f(s)\ <2e for all sufficiently large s, and since e 
can be arbitrarily small, we have proved (3.3). 

(b) The idea of proof is similar to the preceding, e > 0 is arbitrary, but 
now we choose 6 > 0 so small that \f(t) — /(0+)| < e for 0 < t < S. With 
So as above we get, for s > sq, 



*/(*) 



pS pS p oo 

* / (/(*) - f{0+))e~ st dt + sf(0+) / e~ st dt + s / f(t) 
Jo Jo Js 



3~ st dt. 



The modulus of the first term is 

rS 



< se 



[ e~ st dt < se r 
Jo Jo 



-st 



dt = se • - = £, if s > 0. 



The second term can be computed: 



= 8f( 0+) 



1 — e 



—sS 



= /(0+)(l - e sS ) -4 / (0+) as s -¥ oo. 



Finally, the modulus of the third term can be estimated: 

poo poo 

< s / |/(t)|e- Sot e- (s - So)f dt < se~ sS ■ e s ° s / |/(t)| e~ sot dt = Cse~ Ss , 

Js Js 

which tends to zero as s — » oo. Just as in the proof of (3.3) we can draw 
the conclusion (3.4). 
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(c) This proof also runs along similar paths. We begin by writing 

T poo 

f(t) e~ st dt + s J ( f(t ) - /( oo))e~ st dt + f(oo)e~ sT . 

Choose T so large that \f(t) — /(oo)| < e for t > T. The modulus of the 

first term can be estimated by s / Q T |/| — >* 0 as s -* 0+, and the modulus 
of the second one is 

e • e~ st dt = £ e~ sT < s. 

The proof is finished in an analogous way to the others. □ 

We round off this section by a generalization of the rule for Laplace 
transformation of a power of t (cf. Example 3.6). To this end we need a 
generalization of factorials to non-integers. This is provided by Euler’s 
Gamma function , whis is defined by 






,x—l—u 



du , 



x > 0. 



It is easy to see that this integral converges for positive a:. It is also easy 
to see that T( 1) = 1. Integrating by parts we find 



r(* + i)= f 

JO 



u x e u du 



+ x 



POO 

/ ti *- 1 

Jo 



e u du = xT(x). 



From this we deduce that T(2) = 1 • T(l) = 1, T(3) = 2, and, by induction, 
T(n + 1) = n! for integral n. Thus, this function can be viewed as an 
interpolation of the factorial. 

Now we let f(t) = t a , where a > — 1. It is then clear that / has a Laplace 
transform, and we find, for s > 0, 




st = u \ _ [°° _ u du 
dt — du j s j J q \ s / s 



1 

5 a+i 




u a e u du — 



r(fl + l) 

s a + 1 



If a is an integer, this reduces to the formula of Example 3.6. 



Exercises 

3.4 Find the Laplace transforms of (a) 2 1 2 — e~ l 

(b) (t 2 + l) 2 (c) (sin t — cost) 2 (d) cosh 2 4t (e)e 2t sin3£ (f)t 3 sin3t. 

1/s for 0 < t < s, 

0 otherwise. 



3.5 Compute the Laplace transform of f(t) 



-{i 



3.6 Find the transform of f(t) = 



(t — l) 2 for t > 1, 
0 otherwise. 
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3.7 

3.8 

3.9 

3.10 



3.11 

3.12 

3.13 



Solve the same problem for f(t ) 



-f 1 - 



■ du. 



f 



Compute / te 6t sin tdt. (Hint: /(3) !) 



Find the Laplace transform of /, if we define f(t) = tsint for 0 < t < 7r, 
f(t ) = 0 otherwise. (Hint: use the result of Exercise 3.3, p. 42.) 

Find the Laplace transform of the function / defined by 



f(t) — na for n — 1 < x < n, 
Compute C\te~ l sin£](s). 



n = 1,2,3,.. 



Explain why the function 



s 2 + 1 



cannot be the Laplace transform of any 



/ € £. 

Show that if / is periodic with period a, then 



f(s) = 1 Ze- ' a, J o /(*)« Stdt 

(Hint: Let u = t — ak, use the formula for the sum of 

a geometric series.) 

3.14 Find the Laplace transform of the function with period 1 that is described 
by f(t) = t for 0 < t < 1. 

3.15 Verify the final value rule (3.5) for f(s) = 1 /(s(s + 1)) by comparing /(£) 
and lim sf(s). 

s—*0+ 

3.16 Prove that . What are the values of r(§) and r(§) ? 



3.3 Applications to differential equations 

Example 3.8. Let us try to solve the initial value problem 

y" - Ay f + 3y = t, t > 0; y{ 0) = 3, y f ( 0) = 2. (3.6) 

We assume that y = y(t) is a solution such that y , as well as y ! and y", has 
a Laplace transform. By Theorem 3.3 we have then 

£[y'](s) = sy - y(0) = sy - 3, 

£[y"]( s ) = s£[ 2 /'](s) - y'( 0) = s(sy- 3) - 2 = s 2 y - 3s - 2. 

Due to linearity, we can transform the left-hand side of the equation to get 
{ s 2 y — 3s — 2) — 4 (sy — 3) -F 3 y = ( s 2 — As + 3 )y — 3s + 10, 
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and this must be equal to the transform of the right-hand side, which is 
1/s 2 . The result is an algebraic equation, which we can solve for y: 



(s 2 — As + 3 )y — 3s + 10 



1 



y 



3 s 3 - 10s 2 + 1 3s 3 - 10s 2 + 1 



s 2 (s 2 - 4s + 3) s 2 (s - l)(s - 3) * 
The last expression can be expanded into partial fractions. Assume that 



3s 3 - 10s 2 + 1 A BCD 

s 2 (s — l)(s — 3) s 2+ s + s-l + s-3* 



Multiplying by the common denominator and identifying coefficients we 
find that A=|,JB=|,C = 3, and D = — Thus we have 





1 

s — 1 



4 J_ 

9 s — 3 



It so happens that there exists a function with precisely this Laplace trans- 
form, namely, the function 



\t + ^ + 3e* 



4 3t 
9 e ‘ 



Could it be the case that y = z ? One way of finding this out is by differ- 
entiating and investigating if indeed z does satisfy the equation and initial 
conditions. And it does (check for yourself)! By the general theory of dif- 
ferential equations, the problem (3.6) has a unique solution, and it follows 
that z must be the solution we are looking for. □ 

The example demonstrates a very useful method for treating linear in- 
tit ial value problems. There is one difficulty that is revealed at the end of 
the example: could it be possible that two different functions might have 
the same Laplace transform? This question is answered by the following 
theorem. 



Theorem 3.5 (Uniqueness for Laplace transforms) If f and g both 
belong to E, and f(s ) = g(s) for all (sufficiently) large values of s, then 
f(t) = gift) for all values of t where f and g are continuous. 

We omit the proof of this at this point. It is given in Sec. 7.10. In that 
section we also prove a formula for the reconstruction of f(t) when f(s) 
is known — a so-called inversion formula for the Laplace transform. The 
present theorem, however, gives us the possibility to invert Laplace trans- 
forms by recognizing functions, just as we did in the example. 

This requires that we have access to a table of Laplace transforms of 
such functions that can be expected to occur. Such a table is found at the 
end of the book (p. 247 ff), and similar tables are included in all decent 
handbooks on the subject. Several of the entries in such tables have already 
been proved in the examples of this chapter; others can be done as exercises 
by the interested student. 
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We point out that the uniqueness result as such does not rule out the 
possibility that a differential equation (or other problem) may have solu- 
tions that have no Laplace transforms, e.g., solutions that grow faster than 
exponentially. To preclude such solutions one must look into the theory 
of differential equations. For linear equations there is a result on unique 
solutions for initial value problems, which may serve the purpose. If the 
coefficients are constants and the equation is homogeneous, one actually 
knows that all solutions have at most exponential growth. 

The Laplace transform method is ideally adapted to solving initial value 
problems. Strictly speaking, the method takes into consideration only what 
goes on for t > 0. Very often, however, the expressions obtained for the 
solutions are also valid for t < 0. 

We include some examples on using a table of Laplace transforms in a 
few more complicated situations. The technique may remind the reader of 
the integration of rational functions. 

- 2s + 3 

Example 3.9. Find /(£), when f(s) = — . 

^ J w ’ J w s 2 4- 4s + 13 

Solution. Complete the square in the denominator: s 2 +4s+13 = (s+2) 2 +9. 
Then split the numerator to enable us to recognize transforms of cosines 
and sines: 

2s + 3 2(s + 2) — 1 o s + 2 i 3 

s 2 + 4s + 13 = (s + 2) 2 + 3 2 = ’ (s + 2) 2 + 3 2 “ 5 ‘ (s + 2) 2 + 3 2 ’ 



and now we can see that this is the transform of f(t) = 2e 2t cos3 1 — 
le _2t sin3 1. □ 

Example 3.10. Find g(t), if g(s) = ^ , 2 . 

I 5 + ij 

Solution. We recognize the transform as a derivative: 



9(a) = ~ 



d_ 1 
ds s 2 + 1 ’ 



By Theorem 3.2 and the known transform of the sine we get g(t) = tsint. 

□ 



Example 3.11. Solve the initial value problem 

y ,, + 4i/ , + 13 2 / = 13, y(0) = i/(0) = 0. 



Solution. Transformation gives 

(s 2 + 4s + 13 )y = — 
s 



13 

s((s + 2) 2 + 9) * 
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Expand into partial fractions: 

^ 1 s + 4 1 5 + 2 2 3 

y= 8 ~ (s + 2)2 + 9 = s ~ (s + 2)2 + 9 “ 5 ' (s + 2)2 + 9 ' 

The solution is found to be 

y(t) = (l - e _2t (cos3 1 + | sin 3 t))H(t). 

(Here we have multiplied the result by a Heaviside factor, to indicate that 
we are considering the solution only for t > 0. This factor is often omitted. 
Whether or not it should be there is often a matter of dispute among users 
of the transform.) □ 

We can also treat systems of differential equations. 

Example 3.12. Solve the initial value problem 



x' = x + 3 y, 
y' = 3x + y- 



*(0) = 5, 2/(0) = 1. 



Solution. Laplace transformation gives 

{ sx — 5 = x + 3y i (1 — s)x + 3y= -5 

sy - 1 = 3x + y y 3x + (1 - s)y= -1 

We can, for example, solve the second equation for x = ^(s — 1 )y — | and 
substitute this into the first, whereupon simplification yields (s 2 —2s—S)y = 
s + 14 and 

5 + 14 3 2 

V ~ (s-4)(s + 2) “ s-4 s + 2' 

We see that y = 3e 4t — 2e _2t , and then we deduce, in one way or another, 
that x = 3e 4t + 2e _2t . (Think of at least three different ways of performing 
this last step!) □ 

Finally, we demonstrate how even a partial differential equation can be 
treated by Laplace transforms. The trick is to transform with respect to one 
of the independent variables and let the others stand. Using this technique 
often involves taking rather bold chances in the hope that rules of compu- 
tation be valid. One way of regarding this is to view it precisely as taking 
chances - if we arrive at a tentative solution, it can always be checked by 
substitution in the original problem. 

Example 3.13. Find a solution of the problem 



d 2 u 
dx 2 



du 

W 



0 < x < 1, t > 0; 



-u(0, t) = 1, u(l,t) = 1, t > 0; 
u(x, 0) = 1 + sin 7rx, 0 < x < 1. 
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Solution. We introduce the Laplace transform U(x,s) of u(x,t ), i.e., 

poo 

U (x, s) = C[t ^ u(x, t )] (s) = / u(x, t ) e~ st dt. 

Jo 

Here, x is thought of as a constant. Then we change our attitude and 
assume that this integral can be differentiated with respect to x, indeed 
twice, so that 



02 rr o2 poo poo o2 



The differential equation is then transformed into 



= sU — (1 + sin7rx), 0 < x < 1, 



and the boundary conditions into 



U(0,s) 



U(l,s) = ~. 

S 



Now we switch attitudes again: think of s as a constant and solve the 
boundary value problem. Just to feel comfortable we could write the equa- 
tion as 

U" — sU = —1 — sin7rx. (3.7) 

The homogeneous equation has a characteristic equation r 2 — s = 0 and 
its solution is Up = Ae Xy ^ + Be~ Xy/ (Here, the “constants” A and B 
are in general functions of s .) A particular solution to the inhomogeneous 
equation could have the form Up = a + 6 sin7rx + ccos7rx, and insertion 
and identification gives a = 1/s, b = l/(s + 7 r 2 ), c = 0. Thus the general 
solution of (3.7) is 

U(x, s) = A(s)e x ^ s + B{s)e~ xVS + - + 



The boundary conditions force us to take A(s) = B(s) = 0, so we are left 
1 sin 7 xx 

with U(x,s) = — I r. Now we again consider x as a constant and 

s s + 7T Z 

recognize that U is the Laplace transform of u(x,t) = 1 + e _7r t sin7rx. 
The fact that this function really does solve the original problem must be 
checked directly (since we have made an assumption on differentiability of 
an integral, which might have been too bold). □ 



Remark. This problem can also be attacked by other methods developed in later 
parts of the book (Chapter 6). □ 
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Exercises 

1 3 

3.17 Invert the following Laplace transforms: (a) ^ ^ (b) ^ — 

15 1 1 

(c ^ s(s + 2) 2 ^ s 2 (s — 5) 2 (e) (s - a)(s - b) s 2 + 4s + 29' 

3.18 Use partial fractions to find / when f(s ) is given by 

(a) s~ 2 (s + 1) _1 , (b) b 2 s~ 1 (s 2 + 6 2 ) -1 , (c) s(s - 3) -5 , 

(d) ( s 2 + 2)s _1 (s + l) _1 (s + 2) _1 . 



3.19 Invert the following Laplace transforms: (a) (b) -r-~ 

s (s — lj(s 

(c)ln rf2 (d)ln 4rn (e) i« (t) ^- 

3.20 Solve the initial value problem y" + y — 2e l , t > 0, y{ 0) = y'( 0) = 2. 

oo, o_,._ .U. u,._ iy"(t)-2y , (t) + y(t) = te t sint, 



3.21 Solve the initial value problem 



3/(0) = 0, 3/(0) = 0. 



3 22 Solve { ~ V + 4y ^ ~ 4y ® = _3e * + 4e2 *’ 

V 1 2/(0) = 0, 3/(0) = 5, y"(0) = 3. 

f z'(t) +j/'(t) = t, 

3.23 Solve the system < x"(t ) — y(t) = e _t , 

l®(0) = 3, x'(0) = -2, y(0) = 0. 

{ x f (t) - y f (t) - 2 x(t) + 2 y(t) = sint, 
x"(t) + 2y'(t) + x(t) = 0, 
x(0) = x'(0) = y(0) = 0. 

3.25 Solve the problem 

3 /"(*) - 3y'(t) + 2y(t) = { q t < 2 ’ = V = °' 



3.26 Solve the system 



= 2z — 2y -f e 



= y-3z 



t> 0; 3/(0) = 1, z(0) = 2. 



3.27 Solve the differential equation 

23 / 1V ^ + y" - y” - y - y = t + 2, t > 0, 

with initial conditions y( 0) = y'(0) = 0, y"(0) = y f "(0) = 1. 

3.28 Solve the differential equation 



y" + 3y' + 2y = e * sint, t > 0; 2/(0) = 1, y'(0) = -3. 
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3.4 Convolution 



In control theory, for example, one studies the effect on an incoming signal 
by a “black box” that transforms it into an “outsignal” : 



insignal 



black 

box 



outsignal ^ 



Let the insignal be represented by the function t x(t), t > 0, and the 
outsignal by t 1 -» y(£), t > 0. We assume that the system has four important 
properties: 



(a) it is linear , which means that a linear combination of inputs results in 

the corresponding linear combination of outputs; 

(b) it is translation invariant , which means, loosely, that the black box 

operates in the same way at all points in time; 

(c) it is continuous in the sense that “small” changes in the input generate 

“small” changes in the output (which should be formulated more 
precisely when necessary); 

(d) it is causal , i.e., the outsignal at a certain moment t does not depend 

on the insignal at moments later than t. 



It can then be shown (see Appendix A) that there exists a function 
t g(t ), t > 0, such that 



x(u)g(t — u) du = / x(t — u)g(u) du. (3.8) 

Jo 

The function g can be said to contain all information about the system. 

The formula (3.8) is an example of a notion called the convolution of 
the two functions x and g. (We shall encounter other versions of convolu- 
tion in other parts of this book.) We shall now study this notion from a 
mathematical point of view. 

Thus, we assume that / and g are two functions, both belonging to £. 
The convolution f * g is a new function defined by the formula 




(/* 



9)(t) = f * g(t) = [ ft 
Jo 



u ) g(t — u) du , 



t > 0. 



It is not hard to see that this function is continuous on [0, oo[, and it might 
possibly belong to £. Indeed, it is not very difficult to show directly that 
if / 6 £fci and g E £fc 2 > then / * g E £& for all k > max(fci, A^). (See 
Exercise 3.38.) Using the notation do (/), introduced after Theorem 3.1, we 
could express this as do(/ * g) < max(d 0 (/), d 0 (<?)).) 
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Convolution can be regarded as an operation for functions, a sort of 
“multiplication.” For this operation a few simple rules hold; the reader is 
invited to check them out: 



f*9=9*f 
f *(g*h) = (f * g)*h 
f*(g + h)=f*g + f*h 



(commutative law) 
(associative law) 
(distributive law) 



Example 3.14. Let fit) = e*, g(t ) = e 2t . Then 

f*g{t)= f e u e~ 2 ^- u Uu= f e u ~ 2t+2u du = e~ 2t f e* u du 

Jo Jo Jo 



= e 



-2 1 n „3u1 u=t _ 1 -2t( 3t 



[U 3u ] 

Lo Ju=l 



0 



(e 3t - 1) = 



e 4 - e~ 2t 



□ 



Example 3.15. If g(t) = 1, then f*g(t) = f(u) du. Thus, “integration” 
can be considered to be convolution with the function 1. □ 

When dealing with convolutions, the Laplace transform is useful because 
of the following theorem. 



Theorem 3.6 The Laplace transform of a convolution is the product of 
the Laplace transforms of the two convolution factors: 

£[/*s](s) = f(s)g(s). 



Proof. Let s be so large that both f(s) and g(s) exist. We have agreed 
in section 3.1 that this means that the corresponding integrals converge 
absolutely. Now consider the improper double integral 

JJ \f(u)9(v)\e- s{u+v) dudv, 

Q 



where Q is the first quadrant in the uv plane. The integrated function being 
positive, the integral can be calculated just as we choose. For example, we 
can write 



/* p poo p o 

Jj \f(u)g(v)\e~ s ( u+v ^ dudv = J du j 



\m\g(v)\e 




The two one-dimensional integrals here are assumed to be convergent, 
which means that the double integral also converges. But this in turn means 
that the improper double integral without modulus signs, 



$( 5 ) = JJ f(u)g(v)e dudv 
Q 
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is absolutely convergent. It can then also be computed in any manner, and 
we do it in two ways. One way is imitating the previous calculation: 



poo poo 

<b(s)= / du f(u)g(v)e~ su e~ sv dv 
Jo Jo 

/»oo 



poo poo ~ 

= / f{u)e~ su du / g{v)e~ sv dv = f(s) g(s). 

Jo Jo 

Another way is integrating on triangles Dt : u > 0, v > 0, u + v < T. But 

J f * g(t)e~ st dt = J f {u)g(t — u) duj e~ su dt 

— f dt f f(u)e~ su g{t — u)e~ s ^~ u ^du 
Jo Jo 

= J o f{u)e~ su du ^ g(t - dt = { * = * } 

pT pT—u 

= / f(u)e~ su du / g(v)e~ sv dv 
Jo Jo 

= JJ f(u)g(v)e~ su e~ sv dudv — > $(s) 



Dt 



as T —>>oo. This proves the formula in the theorem. 



□' 



Example 3.16. As an illustration of the theorem we can take the situation 
in Example 3.14. There we have 



/M = 7TT- s(«) = 7f2. 



f(s)g(s) = 



1 



I 1 

3 3 



(s — l)(s + 2) s — 1 5 + 2 



= £[f*g]{s)- 



□ 



Example 3.17. Find a function / that satisfies the integral equation 

poo 

f(t) — 1 + / f(t — u) sinudu, t> 0. 

Jo 

Solution. Suppose that / 6 £. Then we can transform the equation to get 



from which we solve 

/(«) 



f(s) = I + /(*) • r, 

5 S z + 1 



s 2 + 1 1 _ s 2 + 1 _ 1 1_ 

S 2 S S 3 S + S 3 
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and we see that /(£) = 1 + \t 2 ought to be a solution. Indeed it is, be- 
cause this function belongs to £, and then our successive steps make up a 
sequence of equivalent statements. (It is also possible to check the solution 
by substitution in the given integral equation. This should be done, if time 
permits.) □ 



Exercises 

3.29 Calculate directly the convolution of e at and e bt (consider separately the 
cases a t and a = b). Check the result by taking Laplace transforms. 

3.30 Use the convolution formula to determine / if /(s) is given by 

(a) s -1 (s + l) -1 , (b) s~ 1 (s 2 + a 2 ) -1 . 

0 2 

3.31 Find a function with the Laplace transform 



(s 2 + l) 2 ‘ 



3.32 Find a function / such that 



rx 

/ e~ v cos y j 

Jo 



if(x-y)dy = x 2 e x , x>0. 

3.33 Find a solution of the integral equation 



/<*- 



u) 2 f (u) du = t 3 , t > 0. 



3.34 Find two solutions of the integral equation (3.2) on page 41. 

3.35 Find a function y(t) that satisfies y( 0) = 0 and 

2 f (t — u) 2 y(u) du + y (t) = (t — l) 2 for t > 0. 

Jo 

3.36 Find a function f(t) for t > 0, that satisfies 

/( o) = 1, f'(t) + 3 f(t) + f(u)e u ~* du = ( 

3.37 Find a solution / of the integral-differential equation 

'* [ e v cos 2 (t - y) f(y) dy = f'(t) + f(t) - e~\ /( 0) = 0. 

Jo 



0 < t < 2, 
t> 2 



5e" 



3.38 Prove the following result: if / E £fc x and g G £fc 2 > fhen / * g G £fe for all 
k > max{A;i, fe}. 
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3.5 *Laplace transforms of distributions 

Laplace transforms can be used in the study of physical phenomena that 
take place in a time interval that starts at a certain moment, at which the 
clock is set to t = 0. It is possible to allow the functions to include instan- 
taneous pulses and even more far-reaching generalizations of the classical 
notion of a function - i.e., to allow so-called distributions into the game. 
When we do so, it will normally be a good thing to allow such things to 
happen also at the very moment t = 0, so we modify slightly the definition 
of the Laplace transform into the following formula: 




If / is an ordinary function, the modified definition agrees with the former 
one. But if / is a distribution, something new may occur. 

As an example, let 5 a (t) be the Dirac pulse at the point a, where a > 0. 
Then 

POO 

S a (s) = / 6 a (t)e- st dt = e~ as . 

Jo- 
in particular, if a = 0, we get <S(s) = 1. We see that the rule that a Laplace 
transform must tend to zero as s — )* oo no longer need hold for transforms 
of distributions. 

The formula for the transform of a derivative must also be slightly mod- 
ified. Indeed, integration by parts gives 

_ r°° too r°° _ 

/'(«) = / f{t)e st dt = f(t)e st +s f(t)e~ st dt = sf{s)-f( 0-), 

where /( 0— ) is the left-hand limit of f(t) at 0. This may cause some confu- 
sion when dealing with functions that are considered to be zero for negative 
t but nonzero for positive t. In this case it may now happen that /' includes 
a multiple of 5, which explains the different appearance of the formula. In 
this situation, it is preferable to be very explicit in supplying the factor 
H (t) in the description of functions. 

Example 3.18. Solve the initial value problem 

y" + V + 13 y = S'(t ), y( 0-) = y f ( 0-) = 0. 
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The solution is found to be 

y(t) = e~ 2t (cos3 1 — | sin 3 t)H(t). 

We check it by differentiating: 

y f (t) = e“ 2t (— 2cos3 t + | sin 3 t — 3sin3£ — 2 cos 3 t)H(t) + 8(t) 

= e _2t (— 4cos3 t — | sin 3 t)H(t) + S(t), 

y"(t) = e _2t (8cos3 1 + ^ sin3t + 12sin3£ — 5 cos 3t)H(t) — 4 8(t) + 8 f (t) 
= e" 2t (3cos3 1 + f sin 3 t)H(t) - 4 S(t) + 5'(t). 

Substituting this into the left-hand member of the equation, one sees that 
it indeed solves the problem. □ 



Example 3.19. Find the general solution of the differential equation 
y" + 3 y' + 2 y = 6. 

Solution. It should be wellknown that the solution can be written as the 
sum of the general solution yn of the corresponding homogeneous equation 
y" + 3 y' + 2y = 0, and one particular solution yp of the given equation. We 
easily find yp = C\e~ l + C^e -2 *, and proceed to look for yp. In doing this 
we assume that yp(0—) = y' P ( 0— ) = 0, which gives the simplest Laplace 
transforms. Indeed, y f p = syp and y p = s 2 yp , so that 



S V + 3 W + 2i,p = l <=*■ VP= {s + l ' (s + 2) 
Thus it turns out that 



1 1 
S “b 1 S “b 2 



y P = (e-‘ - e~ 2t )H(t). 

This means that the solution of the given problem is 

y = C x e + C 2 e~ 2t + (e" 4 - (i) 

= (C'i+H(f))e- t + {C 2 -H{t))e~ 2t 

f C\e 4 + C 2 e 24 , t < 0, 

( C\ + l)e 4 + ( C 2 — l)e 24 , t > 0. 

We can see that in each of the intervals t < 0 and t > 0 these expressions 
are solutions of the homogeneous equation, which is in accordance with 
the fact that 5 = 0 in the intervals. What happens at t = 0 is that the 
constants change value in such a way that the first derivative has a jump 
discontinuity and the second derivative contains a 8 pulse (draw pictures!). 

□ 

The particular solution yp found in the preceding problem is called a 
fundamental solution of the equation. Let us now denote it by E; thus, 

E{t) = (e -4 - e~ 2t )H(t). 




3.5 *Laplace transforms of distributions 59 



It is useful in the following situation. Let / be any function, continuous for 
t> 0. We want to find a solution of the problem y" + 3y' + 2y = /. If we 
assume y( 0— ) = y'( 0— ) = 0, we get 



y = 



m 



s 2 + 3s + 2 



= /(*) 



s 2 + 3s + 2 



f(s)E(s). 



This means that y can be found as the convolution of / and E: 



y (t) = f*E{t) = [ f(t — u ) (e 
JO 



-2 u 



) du. 



The fundamental solution thus provides a means for finding a particular 
solution for any inhomogeneuous equation with the given left-hand side. 

This idea can be applied to any linear differential equation with constant 
coefficients. The left-hand member of such an equation can be written in 
the form P(D)y , where D is the differentiation operator and P(-) is a 
polynomial. For example, if P(r) = r 2 + 3r + 2, then 



P(D)y = ( D 2 + 3 D + 2 )y - y" + 3 y’ + 2 y. 



The fundamental solution E is, in the general case, the function such that 
£(s) = -pj^, E(t) = 0iovt<0. 



Exercises 

3.39 Find a solution of the differential equation y ,n + 3 y" -f 3 y' + y = H(t — 1) -f 
6(t — 2), that satisfies y( 0) = y'( 0) = y"( 0) = 0. 

3.40 Solve the differential equation y" + Ay' + by = 5(t), y(t) = 0 for t < 0. Then 
deduce a formula for a particular solution of the equation y" + 4 y' + 5 y = 
f(t) 1 where / is any continuous function such that f(t) = 0 for t < 0. 

3.41 Find fundamental solutions for the following equations: (a) y n + 4 y = 8, 

(b) y" + 4 y f + 8 y = 5, (c) y ,n + 3 y" + 3 y' + y = 6. 

3.42 Find a function y such that y(t) = 0 for t < 0 and 



y'(t) + 




y(u)du = 2 (H(t — 1) — H(t — 2)) for t > 0. 



3.43 Find a function f(t) such that f(t) = 0 for t < 0 and 




e p dp - f(t) + /'(£) = S(t) - te 1 H(t), 



e 



— OO <t< 00. 
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3.6 The Z transform 

In this section we sketch the theory of a discrete analogue of the Laplace 
transform. We have so far been considering functions 1 1 ->> /(£), where t is a 
real variable (mostly thought of as representing time). Now, we shall think 
of t as a variable that only assumes the values 0, 1, 2, ... , i.e., non-negative 
integer values. In applications, this is sometimes more realistic than con- 
sidering a continuous variable; it corresponds to taking measurements at 
equidistant points in time. 

A function of an integer variable is mostly written as a sequence of num- 
bers. This will be the way we do it, at least at the beginning of the section. 

Let {a n }“ = o be a sequence of numbers. We form the infinite series 



A w = % = E anZ ” • 



If the series is convergent for some z, then it converges absolutely outside of 
some circle in the complex plane. More precisely, the domain of convergence 
is a set of the type \z\ > <r, where 0 < a < oo. (It may also happen that 
the series converges at certain points on the circle \z\ = a, but this is 
rarely of any importance.) Power series of this kind, that may encompass 
both positive and negative powers of z, are called Laurent series. (A 
particular case is Taylor series that do not contain any negative powers of z; 
in the present situation we are considering a reversed case, with no positive 
powers.) A necessary and sufficient condition for the series to converge at 
all is that there exist constants M and R such that \a n \ < MR n for all 
n. This condition is analogous to the condition of exponential growth for 
functions to have a Laplace transform. 

The function A(z) is called the Z transform of the sequence {a n }^ = 0 . 
It can be employed to solve certain problems concerning sequences, in a 
manner that is largely analogous to the way that Laplace transforms can 
be used for solving problems for ordinary functions. Important applications 
occur in the theory of electronics, systems engineering, and automatic con- 
trol. 

When working with the Z transformation, one should be familiar with 
the geometric series. Recall that this is the series 



5 >“’ 

n = 0 



where w is a real or complex number. It is convergent precisely if \w\ < 1, 
and its sum is then 1/(1 — w). This fact is used “in both directions,” as the 
following example shows. 
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Example 3.20. If a n = 1 for all n > 0, the Z transform is 

z 

’ 

n= 0 ~ n=0 ~ 1 ~ 

Z 

which is convergent for all z such that \z\ > 1. On the other hand, if A is a 
nonzero complex number, we can rewrite the function B(z) = z/(z — A) in 
this way: 




B(z) 



z — A 



A 

z 



oo 

U 

71=0 



Ay 



\ n 

M>I4 

71=0 



which shows that B(z) is the transform of the sequence b n = A n (n > 0). 
(Here we actually use the fact that Laurent expansions are unique, which 
implies that two different sequences cannot have the same transform.) □ 

We next present a simple, but typical, problem where the transform can 
be used. 



Example 3.21. If we know that ao = 1, a\ = 2 and 



&n +2 — 3n n _|_ i 2a n , n — 0, 1, 2, ... , (3-9) 

find a formula for a n . 

An equation of the type (3.9) is often called a difference equation . In many 
respects, it is analogous to a differential equation: if differential equations 
are used for the description of processes taking place in “continuous time,” 
difference equations can do the corresponding thing in “discrete time.” 

To solve the problem in Example 3.21, we multiply the formula (3.9) by 
z~ n and add up for n = 0, 1, 2, . . .: 

oo oo oo 

^2 a n+ 2 Z~ n = 3 ^2 a-n+iz~ n ~ 2 ^ a nZ~ n ■ (3.10) 

71=0 71=0 71=0 

Now we introduce the Z transform of the sequence {a n }^L 0 : 

OO 0 

A ( z ) = ^2 anZ ~ n = 1+ ~ z + + + ( 3 - n ) 
71=0 

We notice that, firstly, 

oo oo / oo \ / oo \ 

\ 2 a n+lZ~ n = ^2 a kZ~ ( ~ k ~ 1 ' ) = 2 ( ^2 akZ ~ k ) = z ( ^2 ttnZ ~ n - ) 

71=0 k= 1 \/c = l / \71=0 / 

= z{A(z)-l), 



and, secondly, 
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'Y^a n +2Z n = '*Ta k z (k 2) = 



71=0 



k—2 



= Z 




Vn=0 



I-? 

Z 



Thus, the equation (3.10) can be written as 

A(z) - 1 “ “ 3 z(A(z) — 1) - 2 A(z), 

from which A(z) can be solved. After simplification we have 

We saw in the preceding example that this is the Z transform of the se- 
quence 

a n = 2 n , n = 0,1,2, .... 

We can check the result by returning to the statement of the problem: 
ao = 1 and a\ = 2 are all right; and if a n = 2 n and a n+ i = 2 n+1 , then 

3a n+ i - 2a n = 3 • 2 n+1 - 2 • 2 n = 3 • 2 n+1 - 2 n+1 = 2 • 2 n+1 = 2 n+2 , 

which is also right. □ 

In the example, it is obvious from the beginning that the solution is 
unique. If ao and a\ are given, the formula (3.9) produces the subsequent 
values of the a n in an unequivocal way. In general, problems about number 
sequences are often uniquely determined in the same manner. However, 
just as for the Laplace transform, the Z transform cannot be expected to 
give solutions if these are very fast-growing sequences. 

We take a closer look at the correspondence between sequences {a n }^L 0 
and their Z transforms A(z). In order to have an efficient notation we write 
a = {a n }^L 0 and A = Z[a}. Thus, Z denotes a mapping from (a subset of) 
the set of number sequences to the set of Laurent series convergent outside 
of some circle. 

Example 3.22. We have already seen that if a = {A n }o°, then 



Z[a](z) = J2 A"*”" = 



71=0 



A’ 



M> |A|- 



□ 

OQ - n 

Example 3.23. If a = {l/n!}o°, then Z[a](z) = ^ = e 1 / 2 , \z\ > 0. 
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Example 3.24. The sequence a = {n!}o° has no Z transform, because 

oo 

the series ^ n\ z~ n diverges for all z. □ 

n— 0 

As stated at the beginning of this section, a sufficient (and actually nec- 
essary) condition for A(z) to exist is that the numbers a n grow at most 
exponentially: \a n \ < MR n for some numbers M and R. It is easy to 
see that this condition implies the convergence of the series for all z with 
\z\ > R . 

Some computational rules for the transformation Z have been collected in 
the following theorem. In the interest of brevity we introduce some notation 
for operations on number sequences (which can be viewed as functions N — >> 
C). If we let a = {a n }£L 0 and b = {b n }™ =0 , we write a + & = {a n + b n }$JL 0 ; 
and if furthermore A is a complex number, we put Aa = {Aa n }J£L 0 * We also 
agree to write 

A = Z[a], B = Z[b\. 

The “radius of convergence” of the Z transform of a is denoted by cr a : 
this means that the series is convergent for \z\ > o a (and divergent for 
\z\ < (T a )- 

Theorem 3.7 (i) The transformation Z is linear , i.e., 

Z[Xa\(z) = XZ[a\(z), \z\ > <r a , 

Z[a + b](z) = Z[a](z) + Z[b](z), \z\ > max(cr a ,cr b ). 

(ii) If X is a complex number and b n = A n a n , n — 0, 1, 2, . . ., then 

B(z) = A(z/ A), |z| > Xg a . 

(iii) If k is a fixed integer > 0 and b n = a n +k , n = 0, 1, 2, . . then 

B(z ) = z k (a(*) - a o - ^ ^5r) 

= z k A(z) — aoZ k — a\z k ~ x a^-\z, \z\ > cr a . 

(iv) Conversely, if k is a positive integer and b n = a n -k for n > k and 
b n = 0 for n < k, then B(z) = z~ k A(z). 

(v) If b n = na n , n = 0, 1, 2, . . then 

B(z) = -zA'(z ), \z\ > a a - 

Proof The assertions follow rather immediately from the definitions. We 
saw a couple of cases of (iii) in Example 3.21 above. We content ourselves 
by sketching the proofs of (ii) and (v). For (ii) we find 

oo oo oo / \ —n 

B(z) = £ b n z- n = £ \ n a n z~ n = Y, a 4j) = 

71=0 71=0 71=0 ^ 
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And as for (v), the right-hand side is 

, oo oo oo 

-2-— a n z ~ n = = ^2 na nZ 

n — 0 n=0 n — 0 



left-hand side. 

□ 



Example 3.25. Example 3.23 and rule (ii) give us the transform of the 
sequence {A n /n!}o°, viz., 



A (z) = e 1/( * /A) = e x > z . 



□ 

When solving problems concerning the Z transform, you should have a 
table at hand, containing rules of computation as well as actual transforms. 
Such a table is included at the end of this book (p. 250). 

Example 3.26. Find a formula for the so-called Fibonacci numbers, 
which are defined by f 0 = f 1 = 1, / n+2 = /„+ 1 + f n for n > 0. 

Solution. Let F = Z[f], If we Z-transform the recursion formula, using (iii) 
from the theorem, we get 

z 2 F{z) -z 2 -z = (zF(z) -z) + F(z), 



whence (z 2 — z — l)F(z) = z 2 and 



F(z) 



= z 



z 

z 2 -z-l' 



In order to recover f n , a good idea would be to expand into partial fractions, 
in the hope that simple expressions could be looked up in the table on 
page 250. A closer look at this table reveals, however, that it would be a 
good thing to have a z in the numerator of the partial fractions, instead 
of just a constant. Thus, here we have peeled off a factor z from F(z) and 
proceed to expand the remaining expression: 



F(z) z _ A B 

z z 2 — z — 1 2 — a + z — (3* 



where 



i + v$ 






i-VE 



= ^5 + 1 

2\/5 



VS — i 
2VE 



F(z) = 



Az Bz 
z — OL + z — (3 



This gives 
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and from the table we conclude that 



/« = Aa n + B(3 n — 



Vb + in + Vb 
2v^ V 2 



' 2\/5 



l-\/5 

2 



This can be rewritten as 









n+1 






n-f 1 



n = 0, 1, 2, — 



(In spite of all the appearances of y/h in the expression, it is an integer for 
all n > 0.) □ 

As you can see in this example, the method of expanding rational func- 
tions into partial fractions can be useful in dealing with Z transforms, 
provided one starts out by securing an extra factor z to be reintroduced in 
the numerators after the expansion. 

If a and b are two number sequences, we can form a third sequence, c, 
called the convolution of a and 6, by writing 

n n 

C n = ^ ^ Q"n—kbk = ^ tikbri—ki n = 0 , 1 , 2, ... . 



k = 0 



k = 0 



One writes c = a * 6, and we also permit ourselves to write things like 
c n = {a * b) n . We determine the Z transform C = Z[c]: 



C(z) = E E a n-kh Z~ n = E On-fc&fc 

n=0 k= 0 k=0n=k 

oo oo oo oo 

= EE a n-kZ~ {n ~ k) b k Z~ k = b k Z~ k ^2 a n-kZ~ 
k= 0 n=k k = 0 

oo oo 

= E bkZ ~ k E a ^ z ~ m = A (z)B(z). 



( n—k ) 



n=k 



k = 0 



771=0 



The manipulations of the double series are permitted for |z| > max(cr a , cq>), 
because in that region everything converges absolutely. 

This notion of convolution appears in, e.g., control theory, if a system is 
considered in discrete time (see Appendix A). 

Example 3.27. Find x(t ), t = 0, 1, 2 , . . ., from the equation 



y^3 k x(t — k) = 2 *, t = 0,1,2, 

k = 0 
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Solution. The left-hand side is the convolution of x and the function t i-4 
(1/3)*, so that taking Z transforms of both members gives 

z 



z ~t 



r -x{z) 



Z — 



1 * 



(We have used the result of Example 3.22.) We get 



*) = 



_3 _ 



1 



~ _ 1 ~ 1 3 

^2^2 ^ 2 



1 ’ 



and, using Example 3.22 and rule (iv) of Theorem 3.7, we see that 



J 1 for t = 0, 

X(t> = { ©‘ ~ 5 ' ( I )’" 1 

The final expression can be rewritten as 

^)=(i-D-(ir i =^2^=i-2- *>i. 

□ 

In a final example, we indicate a way of viewing the Z transform as 
a particular case of the Laplace transform. Here we use translates of the 
Dirac delta “function,” as in Sec. 3.5. 

Example 3.28. Let {a n }£L 0 be a sequence having a Z transform A(z), 
and define a function / by 

oo oo 

/(*) = X! a n S n(t) = XI ~ «)• 

n = 0 n = 0 

The convergence of this series is no problem, because for any particular t 
at most one of the terms is different from zero. Its Laplace transform must 
be 

OO pOO 00 00 

7{s) = '%2 e~ st a n S(t-n)dt^^2a n e~ ns = ^2a n (e s ) " = A(e s ). 

n - 0 n = 0 n= 0 

Thus, via a change of variable z = e s , the two transforms are more or less 
the same thing. □ 



Exercises 

3.44 Determine the Z transforms of the following sequences {a n }^L o : 

(a) a n = i (b) a n = n • 3” (c) a„ = n 2 • 2 n 

(d) a n = ( n ) = ^ 1) for n > p, = 0 for 0 < n < p (p 

is a fixed integer). 
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3.45 Determine the sequence a = {a n }£L 0 ? if its Z transform is (a) A(z) 

3z~-2’ ^4 

3.46 Determine the numbers a n and 6 n , n = 0, 1, 2, . . ., if ao = 0, bo = 1 and 



An - (-1 — 272 , 

O'n “I - b n ~ f-i = 1, 



3.47 Find the numbers a n and 6 n , n = 0, 1, 2, . . ., if ao = 0, &o = 1 and 



f n n + i 

1 CLn 



+ b n — 2, 

bn + 1 — 0 ) 



72 = 0, 1, 2, 



3.48 Find a n , 72 = 0, 1, 2, . . ., such that ao = ai = 0 and a n + 2 — 3a n +i + 2a n = 
1 - 2 t 2 for 72 = 0, 1, 2, . . .. 

3.49 Find a n , 72 = 0, 1, 2, . . ., if ao = ai = 0 and 

a n + 2 + 2a n +i + a n = (-l) n 72, 72 = 0, 1, 2, 

3.50 Find a n , n = 0, 1, 2, . . ., if ao = 1, ai = 3 and a n +2 + a n = 2 t2 + 4 when 
72 > 0. 

3.51 Determine the numbers y(t) for t = 0, 1, 2, ... , so that 



b* -*> ^ »(« 



3.52 Find a n for n > 0, if ao = 0 and ka n -k — a n + 1 = 2 n for 72 > 0. 

fc =0 

3.53 Determine x(n) for n = 0, 1, 2, . . ., so that 



c(n) + 2 ^^(72 — k) x(k) = 2 n , 72 = 0,1,2,.. 



3.7 Applications in control theory 



We return to the a black box” of Sec. 3.4 (p. 53). Such a box can often be 
described by a differential equation of the type P(D)y(t) = x(t), where x is 
the input and y the output. If x(t) is taken to be a unit pulse, x(t) = 8(t), 
the solution y(t) with y(t) = 0 for t < 0 is called the pulse response , or 
impulse response , of the black box. The pulse response is the same thing as 
the fundamental solution. In the general case, Laplace transformation will 
give P(s)y(s) = 1 and thus y(s) = 1/P(s). The function 



G(s) 



1 

W) 
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is called the transfer function of the box. When solving the general problem 
P(D)y(t) = x(t), y(t) = 0 for t < 0, 

Laplace transformation will now result in 

P(s)y(s) - x(s ) 



or 

y(s) = G(s)x(s). 

This formula is actually the Laplace transform of the convolution formula 
(3.8) of page 53. It provides a quick way of finding the outsignal y to 
any insignal x. The function g in the convolution is actually the impulse 
response. 

In control theory, great importance is attached to the notion of stability. 
A black box is stable , if its impulse response is transient , i.e., g(t) tends 
to zero as time goes by. This means that disturbances in the input will 
affect the output only for a short time and will not accumulate. If P(s ) is 
a polynomial, the impulse response will be transient if and only if all its 
zeroes have negative real parts. 

Example 3.29. The polynomial Pi(s) = s 2 + 2s + 2 has zeroes s = 
—l±i. Both have real part —1, so that the device described by the equation 
y" +2y' +2y = x(t) is stable. In contrast, the polynomial P 2 (s) = s 2 +2s — 1 
has zeroes s = -1 ± V2. One of these is positive, which implies that the 
corresponding black box is unstable. Finally, the polynomial Ps(s) = s 2 + 1 
has zeroes s = ±i. These have real part zero; the impulse reponse is g(t) = 
sint, which is not transient. The situation is considered as unstable. (It is 
unstable also inasmuch as a small disturbance of the coefficients of Pz(s) 
can cause the zeroes to move into the right half-plane, which gives rise to 
exponentially growing solutions.) □ 

So far, we have assumed that the black box is described in continuous 
time. In the real world, it is often more realistic to assume that time is 
discrete, i.e., that input and output are sampled at equidistant points in 
time. For simplicity, we assume that the sampling is done at t = 0, 1, 2, . . ., 
and that the input signal x(t) and the output y(t) are both zero for t < 0. 
Then, of course, the Z transform is the adequate tool. 

A black box is often described by a difference equation of the type 

y(t+k)+a,k-iy(t+k-l)-\ \-a2y(t+2)+aiy(t+l)+ a 0 y(t) =x(t), t e Z. 

(3.12) 

We introduce the characteristic polynomial 

P(z) = + ajs—iz^ 1 + • • • + a 2 Z 2 + a\Z + a 0 . 

We assumed that x(t) and y(t) were both zero for negative t. Putting 
t = — k in (3.12), we find that 

2/(0) = x(-k) - a k -iy(-l) aiy{-k + 1) - a 0 y(-k), 
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which implies that also y(0) = 0. Consequently, putting t = — k -f 1, also 
y( 1) = 0, and so on. Not until we have an x(t) that is different from 
zero do we find a y(t + k) different from zero. Thus we have initial values 
y( 0) = • • • = y(k — 1) = 0. By the rules for the Z transform, we can then 
easily transform the equation (3.12). With obvious notation we get 

P(z)Y(z) = X(z). 



Thus, 

Y (z) = ^ = C(z) x ( z ), 

where G(z) = 1/P(z) is the transfer function. Just as in the previous 
situation, it is also the impulse response, because it is the output resulting 
from inputting the signal 

S(t) = 1 for t = 0, S(t) = 0 otherwise. 

The stability of equation (3.12) hinges on the localization of the zeroes 
of the polynomial P(z). As can be seen from a table of Z transforms, a 
zero a of P(z) implies that the solution contains terms involving af. Thus 
we have stability precisely if all the zeroes of P(z) are in the interior of the 
unit disc \z\ < 1. 

Example 3.30. The difference equation y(t + 2) + ^y(t + l) + \y(t) = x(t) 

has P(z) = z 2 + \z+ \ with zeroes z = —\±^i. These satisfy \z\ = \ < 1, 
so that the equation is stable. The equation 

y{t + 3) + 2 y(t + 2) - y{t + 1) + 2 y(t) = x(t) 

is unstable. This can be seen from the constant term (= 2) of the char- 
acteristic polynomial; as is well known, this term is (plus or minus) the 
product of the zeroes, which implies that these cannot all be of modulus 
less than one. □ 

More sophisticated methods for localizing the zeroes of polynomials can 
be found in the literature on complex analysis and in books dealing with 
these applications. 

Exercises 

3.54 Investigate the stability of the following equations: 

(a) y" H-2y' H- 3y = x(t), (b) y" , +3y ,/ +3j/ , +y = x(t), (c) y"-h4y = x(t). 

3.55 Are these difference equations stable or unstable? 

(a) 2 y(t + 2) - 2 y(t + 1) + y(t) = x(t), 

(b) y(t + 2) - y(t + 1) + y(t) = x(t), 

(c) 2 y(t + 3) - y(t + 2) + 3 y(t + 1) + 3 y(t) = x(t). 
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Summary of Chapter 3 



To provide an overview of the results of this chapter, we collect the main defini- 
tions and theorems here. The precise details of the conditions for the validity of 
the results are sometimes indicated rather sketchily. Thus, this summary should 
serve as a memory refresher. Details should be looked up in the core of the text. 
Facts that rather belong in a table of transforms, such as rules of computation, 
are not included here, but can be found at the end of the book (p. 247 ff). 

Definition 

If f(t) is defined for t E R and f(t) = 0 for t < 0, its Laplace transform is 
defined by 

~ r°° 

f(s ) = / f{t)e~ st dt, 

Jo 

provided the integral is abolutely convergent for some value of s. 

Theorem 

For / to exist it is sufficient that / grows at most exponentially, i.e., that 
|/(£)| < Me kt for some constants M and k. 

Theorem 

If f(s) = g(s) for all (sufficiently large) s, then f(t) = g(t) for all t where 
both / and g are continuous. 

Theorem 

If we define the convolution h = f * g by 



MO = / * MO = [ f(fi- u )9{u) du= [ 
Jo Jo 



f(u)g(t - u) du, 



then its Laplace transform is h = fg. 

Definition 

If {a n }£L 0 is a sequence of numbers, its zeta transform is defined by 

oo 

M z ) = 'Y^a n z~ n , 

71=0 

provided the series is convergent for some value of z. This holds if \a n \ < 
MR n for some constants M and R. 



Historical notes 

The Laplace transform is, not surprisingly, found in the works of Pierre Simon 
de Laplace, notably his Theorie analytique des probability of 1812. In this book, 
he made free use of Laplace transforms and also generating functions (which are 
related to the Z transform) in a way that baffled his contemporaries. During the 
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nineteenth century, the technique was developed further, and also influenced by 
similar ideas such as the “operational calculus” of Oliver Heaviside (British 
physicist and applied mathematician, 1850-1925). With the development of mod- 
ern technology in computing and control theory, the importance of these methods 
has grown enormously. 



Problems for Chapter 3 



3.56 Solve the system y' — 2z = (1 — t)e t , z' -f 2y = 2te *, t > 0, with initial 
conditions t/(0) = 0, z(0) = 1. 

3.57 Solve the problem y" + 2 y' + 2y = 5e*, t > 0; t/(0) = 1, y'( 0) = 0. 

3.58 Solve the problem y ,n + y" + y f — 3y = 1, t > 0, when y( 0) = y'( 0) = 0, 
2 /'( 0 ) = 1 . 

3.59 Solve the problem y" + 4 y — /(£), t > 0; y(0) = 0, y( 0) = 1, where 



m 



(t~ i) 2 , t > i 
0, 0 < t < 1. 



3.60 Find y = y(t) for t > 0 that solves y" — Ay' +by = y( 0) = 2, y f ( 0) = 0, 
where (p(t ) = 0 for t < 2, <p(t) = 5 for t > 2. 

3.61 Find f(t) for t > 0, such that /( 0) = 1 and 

8 f f(t- u) e~ u du + f'(t) - 3 f(t) + 2e _t = 0, t > 0. 

Jo 

3.62 Let / be the function described by 

f(t) = 0, t < 0; f(t) = t, 0 < t < 1; f(t) = 1, t > 1. 

Solve the differential equation y"(t) + y(t) = f(t) with initial values y(0) = 
0, y\ 0) = 1. 

3.63 Solve y m + y f = t- 1, y( 0) = 2, t/(0) - y”{ 0) = 0. 

3.64 Solve y ,n + 3 y" + 3y' + y = t -f 3, t > 0; y( 0) = 0, y'( 0) = 1, y"{ 0) = 2. 

3.65 Solve the initial value problem 

f 2 "-2/' = e -t , . n 2/(0) =0, 2/(0) = 1; 

\y"+y' + z' + z = 0, * ’ z(0)=0, /(0) = -1. 



3.66 Find / such that /( 0) = 1 and 




(t — u) e u f(u) du + f'(t) + 2t 2 e t 



= 0. 



3.67 Solve the problem 

( y"(t) + 2z'(t) - y(t) = 4e 4 , 2/(0) = 0, y'(0) = 2, 

\z"(t)-2rf(t)-z(t) = 0, ’ z(0)=z'(0) = 0. 
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3.68 Solve y'" + y" + y' + y = 4e"‘, t > 0; 2 /( 0 ) = 0, y'(0) = 3, y"(0) - - 6 . 

3.69 Find / that solves 

f f(u)(t — u) sin(t — u)du — 2 f(t) = 12 e -t , t > 0 ; /( 0 ) = 6 . 

Jo 

3.70 Solve 2 /" + 3 y" + 2 /' - by = 0, t > 0; 2 /( 0 ) = 1, y'(0) = -2, y"(0) = 3. 

3.71 Solve 2 / /// (^) + 2 / ,; (t) + 42 /(£) + 42 / (t) = 8 £ + 4, t > 0, with initial values 
2/(0) = —1, 2/'(0) = 4, y"(0) = 0. 

3.72 Solve 2 /'" + y" + y' + y = 2e~*, t > 0; 2 /( 0 ) = 0 , y'(O) = 2 , y"(0) = - 2 . 

3.73 Find a solution y = y(t) for t > 0 to the initial value problem y" + 2ty' — 
Ay = 1, 2 /( 0 ) = 2 /'( 0 ) = 0. 

3.74 Find a solution of the partial differential equation utt -f 2 ut + xu x + u = xt 
for x > 0 , t > 0 , such that n(x, 0 ) = u t (x , 0 ) = 0 for x > 0 and it( 0 , t) = 0 
for t > 0 . 

3.75 Use Laplace transformation to find a solution of 

y" (t) - ty' (t) + y(t) = 5, t> 0; 2/(0) = 5, 2 /'( 0 ) = 3. 

3.76 Find / such that f(t) — 0 for t < 0 and 

5e _t f e y cos 2(t - y) f(y) dy = f'(t) + f(t) - e~\ t > 0. 

Jo 

3.77 Solve the integral equation y(t) + f*(t — u) y(u) du = 3sin2t. 

3.78 Solve the difference equation a n + 2 — 2a n +i + a n = 5 n for n > 0 with initial 
values ao = ai = 0 and right-hand member (a) b n = 1 , (b) 6 n = e n , 
(c) 60 = 1 , b n = 0 for n > 0 . 




4 

Fourier series 



4.1 Definitions 

We are going to solve, as far as we can, the approximation problem that 
was presented in Sec. 1.4. The strategy will perhaps appear somewhat 
surprising: starting from a function /, we shall define a certain series, and 
in due time we shall find that the function can be recovered from this series 
in various ways. 

All functions that we consider will have period 2i r. The whole theory 
could just as well be carried through for functions having some other period. 
This is equivalent to the standard case that we treat, via a simple linear 
transformation of the independent variable. The formulae that hold in the 
general case are collected in Sec. 4.5. 

A function defined on R with period 2n can alternatively be thought of 
as defined on the unit circle T, the variable being the polar coordinate. 
We shall frequently take this point of view. For example, the integral of / 
over an interval of one period can be written f T f(t) dt. When we want to 
compute this integral, we can choose any convenient period interval for the 
actual calculations: 



27r pa+2'K 



p pTT pZ'K pi 

J T J— 7r Jo Ja 



a e R. 



(If T is viewed as a circle, the integral f T f(t) dt is not to be considered as a 
line integral of the sort used to calculate amounts of work in mechanics, or 
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that appears in complex analysis. Instead, it is a line integral with respect 
to arc length.) 

One must be careful when working on T and speaking of notions such as 
continuity. The statement that / E C(T) must mean that / is continuous at 
all points of the circle. If we switch to viewing / as a 27r-periodic function, 
this function must also be continuous. The formula f(t) = t for — n < t < 7 r, 
for instance, defines a function that cannot be made continuous on T: at 
the point on T that corresponds to t = ±7r, the limits of f(t) from different 
directions are different. 

Similar care must be taken when speaking of functions belonging to 
C k ( T), i.e., having continuous derivatives of orders up to and including 
k. As an example, the definition g(t) = £ 2 , \t\ < 7 r, describes a function 
that is in C(T), but not in C fl (T). The first derivative does not exist at 
t = ± 7 r. This can be seen graphically by drawing the periodic continuation, 
which has corners at these points (sketch a picture!). 

Let us now do a preparatory maneuver. Suppose that a function / is the 
sum of a series 

00 

/(*)= £ c n e int — ^ c n e int . (4.1) 

n= — 00 n£ Z 

We assume that the coefficients c n are complex numbers such that 

^2 i c «i < 00. 

ne z 

By the Weierstrass M-test, the series actually converges absolutely and 
uniformly, since \e lnt \ is always equal to 1. Each term of the series is con- 
tinuous and has period 27r, and the sum function / inherits both these 
properties. 

Now let m be any integer (positive, negative, or zero), and multiply the 
series by e~ irnt . It will still converge uniformly, and it can be integrated 
term by term over a period, such as the interval ( — 7r, 7 t) : 

7 r 7r 

f f{t) e~ imt dt= f J2 °n e i{n ~ m)t dt = Y,c n r e i{n ~ m)t dt. 

L L ”€Z «6Z J ~* 

But it is readily seen that 

27t, k = 0, 

0, k 7^ 0. 



/ 



e lkt dt 



It follows that all the terms in the sum vanish, except the one where n—m = 
0, which is the same thing asn = m, and the result is that 




e imt dt = 27rc m . 
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Thus, for an absolutely convergent series of the form (4.1), the coefficients 
can be computed from the sum function using this formula. This fact can 
be taken as a motivation for the following definition. 

Definition 4.1 Let f be a function with period 2i r that is absolutely Rie- 
mann-integrable over a period. Define the numbers c n , n E Z, by 

Cn = i kJ T m e ~ int dt= hf m e ~ int dt 

These numbers are called the Fourier coefficients of f, and the Fourier 
series of f is the series 

Y,c n e int . 

ne z 

Notice that the definition does not state anything about the convergence 
of the series, even less what its sum might be if it happens to converge. It 
is the main task of this chapter to investigate these questions. 

When dealing simultaneously with several functions and their Fourier 
coefficients it is convenient to indicate to what function the coefficients be- 
long by writing things like c n (f). Another commonly used way of denoting 
the Fourier coefficients of / is /(n). 

When we want to state, as a formula, that / has a certain Fourier series, 
we write 

m~'£c n e int . 

ne z 

This means nothing more or less than the fact that the numbers c n are 
computable from / using certain integrals. 

There are a number of alternative ways of writing the terms in a Fourier 
series. For instance, when dealing with real-valued functions, the complex- 
valued functions e int are often felt to be rather “unnatural.” One can then 
write e int = cos nt + i sin nt and reshape the two terms corresponding to 
±n like this: 

c n e int + c- n e~ int = c n (cosnt + i slant) -f c_ n (cosnt — i slant) 

= (c n + c_ n ) cos nt -\-i(c n — C- n ) sin nt = a n cos nt + b n sin nt, 

n = 1,2,.... 

In the special case n — 0 we have only one term, Co. This gives a series of 
the form 

oo 

Co + n cos nt + b n sin nt ) . 

71=1 

The coefficients in this series are given by new integral formulae: 
an = Cn + c- n = J f(t)e~ mt dt + J /(£)e m * dt 
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int + e -inU df . 



= \f mw 

7T J T 

and similarly one shows that 



\ f m 

K J T 



cos ntdt, n = 1,2, 3, , 



b n = — [ f(t)sinntdt , n — 1,2,3, 

^ Jt 

If we extend the validity of the formula for a n to n = 0, we find that 
ao = 2 cq. For this reason the Fourier series is commonly written 



oo 

/(f) ~ |a 0 + ^(a n cosnt + b n sin nt). (4.2) 

71=1 



This is sometimes called the “real” or trigonometric version of the Fourier 
series for /. It should be stressed that this is nothing but a different way 
of writing the series — it is really the same series as in the definition. 

The terms in the series (4.2) can be interpreted as vibrations of differ- 
ent frequencies. The constant term \ ao is a “DC component,” the term 
a\ cos t + bi sint has period 27 r, the term with n — 2 has half the period 
length, for n = 3 the period is one-third of 27T, etc. These terms can be 
written in yet another way, that emphasizes this physical interpretation. 
The reader should be familiar with the fact that the sum of a cosine and 
a sine with the same period can always be rewritten as a single cosine (or 
sine) function with a phase angle: 

a cos nt + b sin nt = \J a 2 + 6 2 ( . a cos nt H — ; . ^ „ sin nt ) 

W a 2 + b 2 \/a 2 + b ' 2 J 

= \/ a 2 + b 2 (cos a cos nt -F sin a sin nt) = y a 2 + b 2 cos {nt — a), 

where the phase angle a is a number such that cos a = a/ y/a 2 + 6 2 , sin a = 
b/\/a 2 + b 2 . This means that (4.2) can be written in the form 

oo 

A n cos (nt — a n ). (4.3) 

71=0 

This is sometimes called the physical version of the Fourier series. In this 
formula one can immediately see the amplitude A n of each partial fre- 
quency. In this text, however, we shall not work with this form of the 
series, since it is slightly unwieldy from a mathematical point of view. 

When asked to compute the Fourier series of a specific function, it is 
normally up to the reader to choose what version to work with. This is 
illustrated by the following examples. 

Example 4.1. Define / by saying that f(t ) = e* for —7 r < t < n and 
f(t + 27 r) = f(t) for all t. (This leaves f(t) undefined for t = (2n + l)7r, 




4.1 Definitions 77 




but this does not matter. The value of a function at one point or another 
does not affect the values of its Fourier coefficients!) We get a function with 
period 27 r (see Figure 4.1). Its Fourier coefficients are 




g7T — in'K g — 7T+m7T 

27r(l — in) 



1 r e (l -in)t dt 

2tT J-n 

(— l) n (e ,r — e~ v ) 
27r(l — in) 



1 

27T 



e (l-in)t 

1 — in 



J t= — TT 



(— l) n sinh7r 
7r(l — in) 



Here we used the fact that e ±m7r = (— l) n . Now we can write 



m ~ i £ 

ne z 



(-l) n sinh7r , 

: : e 

1 — m 



sinh7r y' (-!)" e int 

7 r ^ 1 — in 

ne z 



□ 

We remind the reader of a couple of notions of symmetry that turn out 
to be useful in connection with Fourier series. A function / defined on R 
is said to be even , if f(—t) = f(t) for all t G R. A function / is odd, 
if f(—t) = (The terms should bring to mind the special function 

f(t) = £ n , which is even if n is an even integer, odd if n is an odd integer.) 
An odd function / on a symmetric interval (—a, a) has the property that 
the integral over (—a, a) is equal to zero. This has useful consequences 
for the so-called real Fourier coefficients a n and b n . If / is even and has 
period 27r, the sine coefficients b n will be zero, and furthermore the cosine 
coefficients will be given by the formula 

2 r 

f even => a n = - / f(t) cos ntdt. 

ft Jo 

In an analogous way, an odd function has all cosine coefficients equal to 
zero, and its sine coefficients are given by 

2 r 

f odd => b n = — / f(t)sinntdt. 
ft Jo 
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FIGURE 4.2. 



When computing the Fourier series for an even or odd function these facts 
are often useful. 



Example 4.2. Let / be an odd function with period 27r, that satisfies 
f(t) = (n - t)/2 for 0 < t < 7r. Find its Fourier series! (See Figure 4.2.) 

Solution. Notice that the description as given actually determines the func- 
tion completely (except for its value at one point in each period, which does 
not matter). Because the function is odd we have a n = 0 and 



Thus, 



_2 r 

7T Jo 



7 T — t 



1 
7 r 

1 1 



(n-t) 



sin nt dt 



- cos nt 



n n* 7r 



n 

sin nt 



+ 



J t=o 

7T _ 1 

J t=o n 






cos nt dt 



m ~ E 



sin nt 



n= 1 



□ 



Example 4.3. Let f(t) = t 2 for \t\ < n and define / outside of this 
interval by proclaiming it to have period 27r (draw a picture!). Find the 
Fourier series of this function. 



Solution. Now the function is even, and so b n = 0 and 

2 r 2 . ^ 2 I" 2 sinntl 77 2 f n . _ 

a n = — t cos ntdt = — t / 2t sin nt dt 

t r Jo a* n o nn J 0 



4 ^ — cos nt 

77-7T n 



l'-4- r 

J 0 n ^ J o 



, 47TCOSn7T ^ 4(-l) r 

1 • cos nt dt = z 0 = 






For n = 0 we must do a separate calculation: 



ao 



_ 2 r 
7T 7 0 



2 7 2 7T j 

* dt = - = 

it 3 



2tt 2 
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Collecting the results we get 






3 







cos nt. 



The series obtained in Example 4.3 is clearly convergent; indeed it even 
converges uniformly, by Weierstrass. At this stage we cannot tell what its 
sum is. The goal of the next few sections is to investigate this. For the 
moment, we can notice two facts about Fourier coefficients: 

Lemma 4.1 Suppose that f is as in the definition of Fourier series. Then 

1. The sequence of Fourier coefficients is bounded; more precisely , 

\c„\ < F J \f(t)\dt for all n. 

2. The Fourier coefficients tend to zero as \n\ -» oo. 

Proof. For the c n we have 

* = !/(*)! = 

where M is a fixed number that does not depend on n. (In just the same 
way one can estimate a n and b n .) The second assertion of the lemma is just 
a case of Riemann-Lebesgue’s lemma. □ 

The constant term in a Fourier series is of particular interest: 




This can be interpreted as the mean value of the function / over one period 
(or over T). This can often be useful in problem-solving. It is also intuitively 
reasonable in that all the other terms of the series have mean value 0 over 
any period (think of the graph of, say, sin nt). 

Exercises 

4.1 Prove the formulae c n = |(a n - ib n ) and c_ n = l(a n + ib n ) for n > 0 
(where b 0 = 0). 

4.2 Assume that / and g are odd functions and h is even. Find out which of 
the following functions are odd or even: / + g, fg , fh, / 2 , / + h. 

4.3 Show that an arbitrary function / on a symmetric interval (—a, a) can be 
decomposed as fs + /o, where fs is even and fo is odd. Also show that 
this decomposition is unique. Hint: put /#(£) = (/(t) + f(—t))/ 2. 



h I'M'-"* 



—int | 
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4.4 Determine the Fourier series of the 27r-periodic function described by f (t) = 
t + 1 for |t| < 7 r. 

4.5 Prove the following relations for a (continuous) function / and its “com- 
plex” Fourier coefficients c n : 

(a) If / is even, then c n = c_ n for all n. 

(b) If / is odd, then c n = — c_ n for all n. 

(c) If / is real- valued, then = c_ n for all n (where denotes complex 
conjugation). 

4.6 Find the Fourier series (in the “real” version) of the functions (a) f(t) = 
cos 2 1, (b) g(t ) = cos 2 t, (c) h(t) = sin 3 t. Sens moral? 

4.7 Let / have the Fourier coefficients {c n }. Prove the following rules for 
Fourier coefficients (F.c.’s): 

(a) Let a E Z. Then the function t e iat f(t) has F.c.’s {c n - a }. 

(b) Let b G R. Then the function t f(t — b) has F.c.’s {e~ tnb c n }- 

4.8 Find the Fourier series of h(t) — e 3lt f(t — 4), when / has period 2tt and 
satisfies f(t) = 1 for \t\ < 2, f(t ) = 0 for 2 < \t\ < ir. 

4.9 Compute the Fourier series of /, where f(t) = e”^, \t\ < 7r, f(t + 2 tt) = 
f(t), t e R. 

4.10 Let / and g be defined on T with Fourier coefficients c n (/) resp. c n (g). 
Define the function h by 

h(t) = J f(t-u) g(u) du. 

Show that h is welldefined on T (i.e., h has also period 27r), and prove that 
c n (h) = c n (f) Cn(g)- (The function h is called the convolution of / and g.) 



4.2 Dirichlet’s and Fejer’s kernels; uniqueness 

It is a regrettable fact that a Fourier series need not be convergent. For 
example, it is possible to construct a continuous function such that its 
Fourier series diverges at a specified point (see, for example, the book by 
Thomas Korner mentioned in the bibliography). We shall see, in due 
time, that if we impose somewhat harder requirements on the function, 
such as differentiability, the results are more positive. 

It is, however, true that the Fourier series of a continuous function is 
Cesaro summable to the values of the function, and this is the main result 
of this section. 

We start by establishing a closed formula for the partial sums of a Fourier 
series. To this end we shall use the following formula: 

Lemma 4.2 



Dn{u) — 
7 r 




sin(iV + \)u 
2n sin \ u 
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Proof. The equality of the two sums follows easily from Euler’s formulae. 
Let us then start from the “complex” version of the sum and compute it 
as a finite geometric sum: 

N 2N i p i{2N+l)u 

2ttD n (u)= Y, e inu = e- iNu Y einu = e ~ Wu — 

n=—N n = 0 

_ e i(N + ±)u( e -i(N +i )u _ e i(Af+i)«) 

^ p iu/2 ^ p —iu/2 p iu/2^j 

e -iNu+i(N+±)u _2i s in(AT 4 - P)u sin(iV + \)u 
e m / 2 —2 i sin ~u sin | it 



□ 

The function is called the DlRlCHLET kernel. Its graph is shown in 
Figure 4.3 on page 87. 

When discussing the convergence of Fourier series, the natural partial 
sums are those containing all frequencies up to a certain value. Thus we 
define the partial sum Sjv(t) to be 

N N 

sjv(t) := \ ao + ^(a n cos nt + b n sin nt) — ^ c n e int . 

n = 1 n=—N 



Using the Dirichlet kernel we can obtain an integral formula for this sum, 
assuming the c n to be the Fourier coefficients of a function /: 



1 du- e 



s N (t)= y c * eint = e h r /(«) 

n=-N n=-N Z J -7r 

= “ / f( u )'h Y e%n{t ~ u)du = [ f( u ) D N (t — u) 

n J n =—N J - * 

If*., . sin(iV + 

= «-/ 



du 



sin (N +_|)u ^ 



In the last step we change the variable (t — u is replaced by u) and make 
use of the periodicity of the integrand. We shall presently take another step 
and form the arithmetic means of the N + 1 first partial sums. To achieve 
this we need a formula for the mean of the corresponding Dirichlet kernels: 

Lemma 4.3 



1 

N + 1 



N 



^ ^ D n (u) 

71=0 



1 

2ir(N + 1) 



sin 1(N + l)u 
sin \u 



Fn{u) := 



2 
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The proof can be done in a way similar to Lemma 4.2 (or in some other 
way). It is left as an exercise. The function F^ft) is called the Fejer kernel. 



Now we can form the mean of the partial sums: 

N 



&N(t) = 



Sp (t) + Si(t) H h S]v(t) 

N + l 



= aFXT £ / -W _ u ) Dn ^> du 

n=0 J -* 

/ 7 r 1 ^ /* 7 r 

/(f - • N — ^ ^2 Dn(u) du = j f(t-u)F N (u)du. 



Lemma 4.4 The Fejer kernel Fn{u) has the following properties: 
1. Fm is an even function , and Fn(u) > 0. 

*. fl n F n (u) du = 1. 

3. If 5 > 0, then liin/v-»oo // Fm{u) du = 0. 



Proof Property 1 is obvious. Number 2 follows from 





|+cos u-\ hcosnu) du = 1, 



n 



0, 1, 2, . . . , iV, 



and the fact that F ^ is the mean of these Dirichlet kernels. Finally, property 
3 can be proved thus: 



a <-[ F ^ d '‘=MWTT)[ 






sin 2 \u 



< 



27r(iV+l) 



r i 

Js Sin 2 



du = 



C 6 



27r(N'+l) sin 2 \5 N + l 



►0 



as N -> oo. □ 

The lemma implies that {Fn}^ =1 is a positive summation kernel such 
as the ones studied in Sec. 2.4. Applying Corollary 2.1 we then have the 
result on Cesaro sums of Fourier series. 



Theorem 4.1 (Fejer ’s theorem) If f is piecewise continuous on T and 
continuous at the point t, then lim aw(t) = f(t). 

N-* oo 



Remark. Using the remark following Corollary 2.1, we can sharpen the result of 
the theorem a bit. If / is continuous in an interval Iq =]ao,6o[, an d I = [a, b] is 
a compact subinterval of Jo, then ON(t) will converge to f(t) uniformly on I. □ 

If a series is convergent in the traditional sense, then its sum coincides 
with the Cesaro limit. This means that if a continuous function happens to 
have a Fourier series, which is seen to be convergent, in one way or another, 
then it actually converges to the function it comes from. In particular we 
have the following theorem. 
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Theorem 4.2 If f is continuous on T and its Fourier coefficients c n are 
such that £|c n | is convergent, then the Fourier series is convergent with 
sum f(t ) for all t G T, and the convergence is even uniform on T. 

The uniform convergence follows using the Weierstrass M-test just as at 
the beginning of this chapter. 

This result can be applied to Example 4.3 of the previous section, where 
we computed the Fourier series of f(t) = t 2 (\t\ < n). Applying the usual 
comparison test, the series obtained is easily seen to be convergent, and 
now we know that its sum is also equal to /(£). We now have this formula: 

7J-2 °° 

t 2 = h 4 y — 5 — cos nt, —7 r <t<ir. (4.4) 

3 “ n 2 ” ~ 

n= 1 



(Why does this formula hold even for t = ±n ?) In particular, we can amuse 
ourselves by inserting various values of t just to see what we get. For t = 0 
the result is 




(-1)" 



n 



2 ‘ 



From this we can conclude that 



(-1)" = 7T 2 

^ n 2 12 



If t = 7r is substituted into (4.4), we have 



7 r 



2 = ^ +4 f tir { _ ir = ^ +A y i 

3 ^ n 2 3 ^n 2 ’ 

n— 1 n= 1 



which enables us to state that 



00 i 2 

E l _ 7T 

n 2 6 

n= 1 



Thus, Fourier series provide a means of computing the sums of numeri- 
cal series. Regrettably, it can hardly be called a “method”: if one faces a 
more-or-less randomly chosen series, there is no general method to find a 
function whose Fourier expansion will help us to sum it. As an illustration 
we mention that it is rather easy to find nice expressions for the values of 



o° 

«•>-£;? 



71=1 



for s = 2, 4, 6 , . . ., but no one has so far found such an expression for, say, 

C(3). 

The following uniqueness result is also a consequence of Theorem 4.2. 
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Theorem 4.3 Suppose that f is piecewise continuous and that all its Fou- 
rier coefficients are 0. Then f(t) = 0 at all points where f is continuous. 

In fact, all the partial sums are zero and the series is trivially convergent, 
and by Theorem 4.2 it must then converge to the function from which it is 
formed. 

Corollary 4.1 If two continuous functions f and g have the same Fourier 
coefficients , then f = g. 

Proof Apply Theorem 4.3 to the function h = f — g. □ 



Exercises 

4.11 Prove the formula for the Fejer kernel (i.e., Lemma 4.3). 

4.12 Study the function f(t) = t 4 — 2i r 2 t 2 , |£| < 7r, and compute the value of 

C(4). 

4.13 Determine the Fourier series of f{t) = |cost|. Prove that the series con- 
verges uniformly to / and find the value of 






(-i)" 

An 2 — 1 



4.14 Prove converse statements to the assertions in Exercise 4.5; i.e., show that 
if / is continuous (say), we can say that 

(a) if c n = c- n for all n, then / is even; 

(b) If c n = —c- n for all n, then / is odd; 

(c) If = c- n for all n, then / is real- valued. 



4.3 Differentiable functions 

Suppose that / E C 1 (T), which means that both / and its derivative /' 
are continuous on T. We compute the Fourier coefficients of the derivative: 

7T 7T 

Cn{f ' ) = h 

— 7 r — 7T 

= T(/( 7T)(-1)” - /(-ttX-1)") + incn(f) = inc n (f). 

(The fact that / is continuous on T implies that /(- 7r) = /( n).) This means 
that if / has the Fourier series c n e lnt , then /' has the series Yl i n c n e int - 
This indeed means that the Fourier series can be differentiated termwise 
(even if we have no information at all concerning the convergence of either 
of the two series). 
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If / G C 2 ( T), the argument can be repeated, and we find that the Fourier 
series of the second derivative is XX - n 2 )c n e int . Since the Fourier coeffi- 
cients of /" are bounded, by Lemma 4.1, we conclude that \ — n 2 c n \ < M 
for some constant M, which implies that \c n \ < M/n 2 for n ^ 0. But then 
we can use Theorem 4.2 to conclude that the Fourier series of / converges 
to f(t) for all t. Here we have a first, simple, sufficient condition on the 
function / itself that ensures a nice behavior of its Fourier series. 

In the next section, we shall see that C 2 can be improved to C 1 and 
indeed even less demanding conditions. 

By iteration of the argument above, the following general result follows. 

Theorem 4.4 If f G C k (T), then \c n \ < M/\n\ k for some constant M. 

The smoother the function, the smaller the Fourier coefficients: a function 
with high differentiability contains small high-frequency components. 

The assertion of the theorem is really rather weak. Indeed, one can say 
more, which is exemplified in Exercises 4.15 and 4.17. 

The situation concerning integration of Fourier series is extremely favor- 
able. It turns out that termwise integration is always possible, both when 
talking about antiderivatives and integrals over an interval. There is one 
complication: if the constant term in the series is not zero, the formally 
integrated series is no longer a Fourier series. However, we postpone the 
treatment of these matters until later on, when it will be easier to carry 
through. (Sec. 5.4, Theorem 5.9 on p. 122.) 

The fact that termwise differentiation is possible can be used when look- 
ing for periodic solutions of differential equations and similar problems. We 
give an example of this. 

Example 4.4. Find a solution y(t) with period 2tt of the differential- 
difference equation y'{t) + 2y(t — n) = sin £, — oo < t < oo. 

Solution . Assume the solution to be the sum of a “complex” Fourier series 
(a “real” series could also be used): 

y(t) = J2c n e int . 

tiGzZ 

If we differentiate termwise and substitute into the given equation, we get 

y'(t)+2y(t-TT) = Y,inc n e int +2Y,Cne int ~ in * = ^(m+2(-l)”)c n e int . 

. ( 4 - 5 ) 

This should be equal to sint = ( e lt — e lt )/(2i) = \ ie %t — \ The 
equality must imply that the coefficients in the last series of (4.5) are zeroes 
for all n ^ ±1, and furthermore 

(*- 2)ci = -^, H-2)c_i = |. 
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From this we solve C\ — ^(2 z — 1), c _ \ — ^(—2 i — 1) (and c n = 0 for all 
other n), which gives 

y(t) = Cl e« + C —\ e~ u = -±(e u + e"«) + ^ i(e« - e"") 

= — | cost + \ i • 2zsint = |(— cost — 2sint). 

Check the solution by substituting into the original equation! □ 

Exercises 

4.15 Prove the following improvement on Theorem 4.4: If / G C fc (T), then 

lim n k c n = 0. 

n— »±oo 

4.16 Find the values of the constant a for which the problem y"(t) + ay(t) = 
y(t + 7r), t G R, has a solution with period 27r which is not identically zero. 
Also, determine all such solutions. 

4.17 Try to prove the following partial improvements on Theorem 4.4: 

(a) If f is continuous and differentiable on T except possibly for a finite 
number of jump discontinuities, then \c n \ < M/\n\ for some constant M. 

(b) If / is continuous on T and has a second derivative everywhere except 
possibly for a finite number of points, where there are “corners” (i.e., the 
left-hand and right-hand first derivatives exist but are different from each 
other), then \c n \ < M/n 2 for some constant M. 



4.4 Point wise convergence 

Time is now ripe for the formulation and proof of our most general theorem 
on the pointwise convergence of Fourier series. We have already mentioned 
that continuity of the function involved is not sufficient. Now let us assume 
that / is defined on T and continuous except possibly for a finite number of 
finite jumps. This means that / is permitted to be discontinuous at a finite 
number of points in each period, but at these points we assume that both 
the one-sided limits exist and are finite. For convenience, we introduce this 
notation for these limits: 



f{to - ) = lim /(*), /(*o+) = lim f(t). 

t/'to t\t 0 



In addition, we assume that the “generalized left-hand derivative” f L (to) 
exists: 

f L ( k ) = lim /(*» + - /(*»-) = lim !<to - “) ~ /(«»-) . 

LK h^o h u\ o -u 



If / happens to be continuous at to, this coincides with the usual left-hand 
derivative; if / has a discontinuity at to? we take care to use the left-hand 
limit instead of just writing /(to)- 
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Symmetrically, we shall also assume that the “generalized right-hand 
derivative” exists: 



fkifo) — 



lim 
h\ o+ 



/(to + h) — f{tp+) 
h 



Intuitively, the existence of these generalized derivatives amounts to the 
fact that at a jump discontinuity, the graphs of the two parts of the function 
on either side of the jump have each an end-point tangent direction. 

In Sec. 4.2 we proved the following formula for the partial sums of the 
Fourier series of f: 



«Jv(0 = ^ J 



sin(7V ■ 



\) u 



sin 



du. 



(4.6) 



What complicates matters is that the Dirichlet kernel occurring in the 
integral is not a positive summation kernel. On the contrary, it takes a lot 
of negative values, which causes a proof along the lines of Theorem 2.1 to 
fail completely (see Figure 4.3). 

We shall make use of the following formula: 



i r’ MN + j)« du = 1 

7r J o sin 



(4.7) 



This follows directly from the fact that the integrated function is 2ttDn(u) 
= 1 + 2 Yli cos nu > where all the cosine terms have integral zero over [0, 7 r] . 

We split the integral (4.6) in two parts, each covering half of the interval 
of integration, and begin by taking care of the right-hand half: 
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Lemma 4.5 



lim — 

iV— ► oo 7 T 




sin(iV + 
sin \u 



du - /(t 0 -). 



Proo/. Rewrite the difference between the integral on the left and the num- 
ber on the right, using (4.7): 



1 

7T 




/(to - «) 



sin(7V +\)u 
sin 



du - /(t 0 -) 



^ / (/(*o - «) - /(*o-)) 

7T /o 

I r /(to-u)-/(t 0 -) 

7T /o -« 



sin(jV + |)u 
sin | ix 
— u 

i — • sin(iV + i )ix du. 

sm±u z 



The last integrand consists of three factors: The first one is continuous 
(except for jumps), and it has a finite limit as u -» 0+, namely, f' L (t 0 ). 
The second factor is continuous and bounded. The product of the two 
first factors is thus a function g(u) which is clearly Riemann-integrable on 
the interval [0, 7r]. By the Riemann-Lebesgue lemma we can then conclude 
that the whole integral tends to zero as N goes to infinity, which proves 
the lemma. □ 

In just the same way one can prove that if / has a generalized right-hand 
derivative at to , then 



lim - 

N-^oo 7 T 




sin (N + \)u 
sin 



du = f(t 0 +). 



Taking the arithmetic mean of the two formulae, we have proved the con- 
vergence theorem: 



Theorem 4.5 Suppose that f has period 2n, and suppose that to is a point 
where f has one-sided limiting values and (generalized) one-sided deriva- 
tives. Then the Fourier series of f converges for t = to to the mean value 
o+) + /(to-)). In particular, if f is continuous at to, the sum of the 
series equals /(to). 

We emphasize that if / is continuous at to, the sum of the series is simply 
/(to). At a point where the function has a jump discontinuity, the sum is 
instead the mean value of the right-hand and left-hand limits. 

It is important to realize that the convergence of a Fourier series at 
a particular point is really dependent only on the local behavior of the 
function in the neighborhood of that point. This is sometimes called the 
Riemann localization principle. 
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Example 4.5. Let us return to Example 4.2 on page 78. Now we finally 
know that the series 

oo . , 

E sin nt 

n 

71=1 

is indeed convergent for all t. If, for example, t = 7t/ 2, we have sin nt equal 
to zero for all even values of n, while sin(2fc 4- l)t = (— l) k . Since / is 
continuous and has a derivative at t — 7r/2, and /( 7 t/2) = 7t/ 4, we obtain 



4 h 2k+1 



i + 1 

3^5 



+ ■ 



(In theory, this formula could be used to compute numerical approximations 
to 7 r, but the series converges so extremely slowly that it is of no practical 
use whatever.) □ 

The most comprehensive theorem concerning pointwise convergence of 
Fourier series of continuous functions was proved in 1966 by Lennart Car- 
LESON. In order to formulate it we first introduce the notion of a zero set: 
a set E C T is called a zero set if, for every e > 0, it is possible to construct 
a sequence of intervals {^n}^ on the circle, that together cover the set 
E and whose total length is less that e. 

Theorem 4.6 (Carleson’s theorem) If f is continuous on T, then its 
Fourier series converges at all points of T except possibly for a zero set. 

In fact, it is not even necessary that / be continuous; it is sufficient 
that / G L 2 (T), which will be explained in Chapter 5. The proof is very 
complicated. 

Carleson’s theorem is “best possible” in the following sense: 

Theorem 4.7 (Kahane and Katznelson) If E is a zero set on T, then 
there exists a continuous function such that its Fourier series diverges pre- 
cisely for all t G E. 



Exercises 

4.18 Define / by letting f(t) = tsint for |t| < 7 r and f(t + 27 r) = f(t) for all t. 
Determine the Fourier series of / and investigate for which values of t it 
converges to f(t). 

4.19 If f(t) = (t + 1) cost for — 7r < t < 7r, what is the sum of the Fourier series 
of / for t = 37 r? (Note that you do not have to compute the series itself!) 

4.20 The function / has period 2tt and satisfies 



m = 



t + 7T, — 7T < t < 0, 
0, 0 < t < 7T. 
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(a) Find the Fourier series of / and sketch the sum of the series on the 
interval [ — 37 t, 37t] . 

oo 

(b) Sum the series 

n= 



^ (2 n - l) 2 ' 



4.21 Let f(x ) be defined for —7 r < x < tt by f(x) = cos \x and for other values 
of x by f(x) = f(x + 2 tt). Determine the Fourier series of /. For all real x, 
investigate whether the series is convergent. Find its sum for x = n • 7r/2, 
n = 1,2,3. 

4.22 Let a be a complex number but not an integer. Determine the Fourier 
series of cos at ( |f| < tt). Use the result to prove the formula 



7T COt 7 TZ 




z) 



(“expansion into partial fractions of the cotangent”). 



4.5 Formulae for other periods 



Here we have collected the formulae for Fourier series of functions with a 
period different from 27T. It is convenient to have a notation for the half- 
period , so we assume that the period is 2 P, where P > 0: 



f(t + 2 P) = f(t) for all t £ R. 



Put Cl = 7 r/P. The number Cl could be called the fundamental angular 
frequency. A linear change of variable in the usual formulae results in the 
following set of formulae: 



/w~E £5 » c<nn< > 

n€Z 



where 



— 



1 

2 P 




dt , 



and, alternatively, 

p 

/(f) ~ | + cosnfif+6 n sin nClt), where = — / /(t) ^ nCltdt. 

n = l n p 



In all cases, the intervals of integration can be changed from (— P, P) to 
an arbitrary interval of length 2P. If / is even or odd, we have the special 
cases 



2 


f P 


f even b n — 0, a n — — 


/ f(t) cos nCltdt, 

Jo 


2 


r p 


/ odd => u n = 0, b n = — 


/ /(f) sinnfifdf. 

Jo 
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All results concerning summability, convergence, differentiability, etc., 
that we have proved in the preceding sections, will of course hold equally 
well for any period length. 

Exercises 

4.23 (a) Determine the Fourier series for the even function / with period 2 that 
satisfies f(t) = t for 0 < t < 1. 

(b) Determine the Fourier series for the odd function / with period 2 that 
satisfies f(t) = t for 0 < t < 1. 

(c) Compare the convergence properties of the series obtained in (a) and 
(b). Illuminate by drawing pictures! 

4.24 Find, in the guise of a “complex” Fourier series, a periodic solution with a 
continuous first derivative on R of the differential equation y" + y' + y = g , 
where g has period Air and g(t) = 1 for |£| < 7r, g(t) = 0 for tt < \t\ < 2ir. 

4.25 Determine a solution with period 2 of the differential-difference equation 
y'(t) + y(t - 1) = cos 2 irt. 

4.26 Compute the Fourier series of the odd function / with period 2 that satifies 
f(x) = x — x 2 for 0 < x < 1. Use the result to find the sum of the series 



^ (2 n + l) 3 ’ 



4.6 Some worked examples 



In this section we give a few more examples of the computational work that 
may occur in calculating the Fourier coefficients of a function. 

Example 4.6. Take f(t) = £cos2£ for —it < t < tt, and assume / to have 
period 27 t. First of all, we try to see if / is even or odd — indeed, it is odd. 
This means that it should be a good idea to compute the Fourier series 
in the “real” version; because all a n will be zero, and b n is given by the 
half-range integral 

2 r 

b n = — tcos2tsmntdt . 

tt Jo 

The computation is now greatly simplified by using the product formula 



sin x cos y = \ (sin(x + y) + sin(x — y )) . 
Integrating by parts, we get 

b n = — J t(sm(n + 2)t + sin (n — 2 )t) dt ( n ^ 2) 



<- 



cos(n + 2 )t cos (n — 2 )t\ 

n + 2 n — 2 ) 



Jo 



n + 2 



n — 2 
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1 f /cos(n + 2)t cos(n-2)£\ 

+— / 1 — \dt 

7r J o V n + 2 n — 2 / 



1 _/cos(n + 2)7r , cos(n-2)7r^ n /(-l) n , (-l) r ^ 

* ' + ~^2~ ) + ° = “ UT? + 7^2 I 

2n(—l) n 
n 2 — 4 

This computation fails for n = 2. For this n we get instead 



■ = \ [ t( si 
n Jo 



in At + 0) dt = - 

7T 



1 r -cos4ti 1 



+ 



j_ r 

4tt Jo 



cos 4t dt 



= _I+0 = _I 

4 ' U 4- 



Noting that &i = — | , we can conveniently describe the Fourier series as 



OO / 1 \ 71 

/(£) ~ — | sin t — | sin 2t — 2 ^ ^ — t - sin nt- 



n= 3 



□ 



Example 4.7. Find the Fourier series of the odd function of period 2 that 
is described by f(t ) = t(l — t) for 0 < t < 1. Using the result, find the value 
of the sum 



oo 



k = 0 



(2fc + l) 3 ' 



Solution. Since the function is odd, we compute a sine series. The coeffi- 
cients are 

b n = 2 



f 



t(l — t) sin nntdt = (integrations by parts) = 



4(1 - (-1)«) 



which is zero for all even values of n. Writing n = 2k + 1 when n is odd, 
we get the series 

i(t\ 8 V' sin ( 2 ^ + 1)^ 

m ~^ h > ( 2fc+1 ) 3 ' 

A sketch of the function shows that / is everywhere continuous and has both 
right- and left-hand derivatives everywhere, which permits us to replace 
the sign ~ by =. In particular we note that if t = then sin(2fc + 1 )nt = 
sin(fc + ^)ir = (-l) fc , so that 



m 



Si 



7T 

51 “ 32 



□ 
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FIGURE 4.4. 



Exercises 



4.27 Find the Fourier series of / with period 1, when /(x) = x for 1 < x < 2. 
Indicate the sum of the series for x = 0 and x = \ . Explain your answer! 

4.28 Develop into Fourier series the function / given by 

x 

/(x) = sin -, -7T < x < 7T; /(x + 2tt) = /(x), x £ R. 



4.29 Compute the Fourier series of period 2tt for the function /(x) = (|x| — 7r) 2 , 
|x| < 7 r, and use it to find the sums 



E 



77. 2 



OO 

and V' -b . 
n 2 



4.7 The Gibbs phenomenon 

Let / be a function that satisfies the conditions for pointwise convergence 
of the Fourier series (Theorem 4.5) and that has a jump discontinuity at a 
certain point to. If we draw a graph of a partial sum of the series, we discover 
a peculiar behavior: When t approaches to, for example, from the left, the 
graph of s n (t) somehow grows restless; you might say that it prepares to 
take off for the jump; and when the jump is accomplished, it overshoots 
the mark somewhat and then calms down again. Figure 4.4 shows a typical 
case. 

This sort of behavior had already been observed during the nineteenth 
century by experimental physicists, and it was then believed to be due to 
imperfection in the measuring apparatuses. The fact that this is not so, but 
that we are dealing with an actual mathematical phenomenon, was proved 
by J. W. Gibbs, after whom the behavior has also been named. 

The behavior is fundamentally due to the fact that the Dirichlet kernel 
D n (t) is restless near t = 0. We are going to analyze the matter in detail 
in one special case and then, using a simple maneuver, show that the same 
sort of thing occurs in the general case. 
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G- 



t 



FIGURE 4.5. 



Let f(t) be a so-called square- wave function with period 27 r, described 
by f(t) = 1 for 0 < t < 7r, f(t) = — 1 for — n < t < 0 (see Figure 4.5). Since 
/ is odd, it has a sine series, with coefficients 






2 2 

— / sin ntdt = — 

n Jo n 



cos nt 



n 






Jo 



rm 



which is zero if n is even. Thus, 



fit) ~ 



4 sin(2fc -j- 1 )t 

7T ^ 2k + 1 
/c=0 



4 

7T 



sin3£ sin5t 
sin t -\ 1 h • • • 

O 0 



(4.8) 



Because of symmetry we can restrict our study to the interval (0, 7 t/ 2). For 
a while we dump the factor 4/7T and consider the partial sums of the series 
in the brackets: 

S n (t) = sint + \ sin3t + l sin5t H h — - — sin(2 n + 1 )t. 

6 5 2n + l 

By differentiation we find 



S f n (t) = cos t + cos St -I h cos(2n + l)t = \ ^( e ^ 2fc+1 ^ -f e 2 ( 2fc+1 ) t ) 

k = o 

_ l.-»(2n+l)(V 1 .akt _ 1 -i(2n+l)t 1 - e »2(2n+2)t ^ S in2(n + l)t 
2 A 2 1 — e i2t 2sin t 



(compare the method that we used to sum D n (t)). The last formula does 
not hold for t = 0, but it does hold in the half-open interval 0 < t < 7r/2. 
The derivative has zeroes in this interval; they are easily found to be where 
2(n+l)£ = kTTOit = Tk — (fc7r)/((2(n+l)), k = 1, 2, . . . , n. Considering the 
sign of the derivative between the zeroes one realizes that these points are 
alternatingly maxima and minima of S n . More precisely, since S n ( 0) = 0, 
integration gives 



.w = f 

Jo 



sin2(n + 1 )u 
2sinu 






where the numerator of the integrand oscillates in a smooth fashion between 
the successive t&, while the denominator increases throughout the interval. 
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This means that the first maximum value, for t = ti, is also the largest, 
and the oscillations in S n then quiet down as t increases (see Figure 4.6). 
It follows that the maximal value of S n (t) on ]0, 7r/2] is given by 



An — $7i(Tl) — $n 



7 r 



2(n + 1) 



-E 

k = 0 



2fc + l 



sm 



(2k + 1)7 r 
2(n + 1) ‘ 



We can interpret the last sum as a Riemann sum for a certain integral: Let 
tk = kir/(n+l) and & = ^(tk+tk+i )- Then the points 0 = ... ,t n +i = 

7r describe a subdivision of the interval (0, 7 r), the point £& lies in the subin- 
terval (sCfcjEfc+i) and, in addition, = (2fc + l)7r/(2(n + 1)). Thus we 
have 



A>=aE 



/c=0 



sin^fc 



Ax k 



if 



sin# 



dx as n — ► 00 . 



A more detailed scrutiny of the limit process would show that the numbers 
A n decrease toward the limit. 

Now we reintroduce the factor 4/7r. We have then established that the 
partial sums of the Fourier series (4.8) have maximum values that tend to 
the limit 



7T Jo t 



1.1789797, 



and the maximal value of S n (t) is taken at t = 7r/(2(n + 1)). On the right- 
hand side of the maximum, the partial sums oscillate around the value 1 
with a decreasing amplitude, up to the point t = tt/2. Because of symmetry, 
the behavior to the left will be analogous. What we want to stress is the 
fact that the maximal oscillation does not tend to zero when more terms 
of the series are added; on the contrary, it stabilizes toward a value that 
is approximately 9 percent of the total size of the jump. The point where 
the maximum oscillation takes place moves indefinitely closer to the point 
of the jump. It is even possible to prove that the Fourier series is actually 
uniformly convergent to 1 on intervals of the form [a, 7r — a], where a > 0. 

Now let g be any function with a jump discontinuity at to with the size 
of the jump equal to S = g(to+) — g(to—), and assume that g satisfies the 
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FIGURE 4.7. 

conditions of Theorem 4.5 for convergence of the Fourier series in some 
neighborhood of to- Form the function h(t) = g(t) — \ Sf(t — to), where 
/ is the square-wave function just investigated. Then, /i(to+) = h(to ~ ), 
so that h is actually continuous at to if one defines h(to) in the proper 
way. Furthermore, h has left- and right-hand derivatives at to, and so the 
Fourier series of h will converge nicely to h in a neighborhood of £ = to- The 
Fourier series of g can be written as the series of h plus some multiple of a 
translate of the series of /; the former series is calm near to, but the latter 
oscillates in the manner demonstrated above. It follows that the series of g 
exhibits on the whole the same restlessness when we approach to, as does 
the series of / when we approach 0. The size of the maximum oscillation is 
also approximately 9 percent of the size of the whole jump. 

If a Fourier series is summed according to Cesaro (Theorem 4.1) or 
Poisson- Abel (see Sec. 6.3), the Gibbs phenomenon disappears completely. 
Compare the graphs of si^(t) in Figure 4.4 and cris(t) (for the same /) in 
Figure 4.7. 



4.8 *Fourier series for distributions 

We shall here consider the generalized functions of Sec. 2.6 and 2.7 and 
their Fourier series. Since the present chapter deals with objects defined on 
T, or, equivalently, periodic phenomena, we begin by considering periodic 
distributions as such. 

In this context, the Heaviside function H is not really interesting. But 
we can still think of the object S a (t) as a “unit pulse” located at a point 
a G T, having the property 

J <p(t)5 a (t) dt = (p(a) if (f is continuous at a. 

The periodic description of the same object consists of a so-called pulse 
train consisting of unit pulses at all the points a -f n • 27r, n G Z. As an 
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object defined on R, this pulse train could be described by 

oo oo 

X] 5 a+2*n{t) = 8{t-a-2im). 

n= — oo n=— oo 



The convergence of this series is uncontroversial, because at any individual 
point t at most one of the terms is different from zero. 

The derivatives of S a can be described using integration by parts, just as 
in Sec. 2.6, but now the integrals are taken over T (i.e., over one period). 
Because everything is periodic, the contributions at the ends of the interval 
will cancel: 




(p{t)8 a {t) 



- fc+27T 

- b 




dt = —ip' (a). 



What would be the Fourier series of these distributions? Let us first 
consider S a . The natural approach is to define Fourier coefficients by the 
formula 






dt 



1 

27 r 



The series then looks like this: 



g-ina 



8a(t) 



\ ^ —ina t int 

2tt 



ne z 



In particular, when a = 0, the Fourier coefficients are all equal to l/(27r), 
and the series is 



S(t) ~ 



1 

27 r 



]r 

nGZ 



By pairing terms with the same values of |n|, we can formally rewrite this 
as 

1 1 

S(t) ~ - — 1 — > cos nt. 
v 27T 7 r ^ 

n= 1 

Compare the Dirichlet kernel! We might say that 5 is the limit of Dn as 
N — > oo. 

These series cannot be convergent in the usual sense, since their terms do 
not tend to zero. But for certain values of t they can be summed according 
to Cesaro. Indeed, we can use the result of Exercise 2.16 on page 22. The 
series for 27r 5 a can be written (with z = 



OO —1 

n= 0 n=— oo 

oo oo oo 



77-GZ 



n=0 n — 1 



n=0 



n=0 
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According to Exercise 2.16, both the series in the last expression can be 
summed (C, 1) if \z\ = 1 but z ■=£ 1, which is the case if t ^ a, and the 
result will be 

1 z _l — z + z(l - z) 1 — zz 1 - \z\ 2 

1 — z + l — z |1 — z\ 2 \l-z\ 2 |1 — z\ 2 ' 

If t = a, all the terms are ones, and the series diverges to infinity. 

Thus the series behaves in a way that is most satisfactory, as it enhances 
our intuitive image of what S a looks like. 

Next we find the Fourier series of 5 f a . The coefficients are 

Cn = k f S’aV)*-™ to 

2t r v Jlt = a 

We recognize that the rule in Sec. 4.3 for the Fourier coefficients of a deriva- 
tive holds true. The summation of the series 



27 r 
in 
2n 6 



n —int 



dt 



t=a 






ne -ian e int 



ne z 



l 

27 r 



ne in{t ' a) 
ne z 



is tougher than that of S a itself, because the terms now have moduli that 
even tend to infinity as \n\ — > oo. It can be shown, however, that for t a 
the series is summable (C, 2) to 0. 

We give a couple of examples to illustrate the use of these series. 

Example 4.8. Consider the function of Example 4.2 on page 78. Its 
Fourier series can be written 



/«)- = 



71=1 



n^O 



(Notice that the last version is correct — the minus sign in the Euler 
formula for sin is incorporated in the sign of the n in the coefficient.) 

The derivative of / consists of an “ordinary” term — \ , which takes care 
of the slope between the jumps, and a pulse train that on T is identified 
with 7 t • 6(t). This would mean that the Fourier series of the derivative is 
given by 

/'(*) = ~ ^ eint 

ne z 

oo oo 

= -5 + sE ei "‘ = 5 E( eint + e ~ int ) = E cosnt 

neZ n=l 7i=l 



Notice that this is precisely what a formal differentiation of the original 
series would yield. □ 
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Example 4.9. Find a 27r-periodic solution of the differential equation 
y' + y — 1 + 6(t) (-7 r <t <tt). 

Solution. We try a solution of the form y = ^c n e mt . Differentiating this 
and expanding the right-hand member in Fourier series, we get 



or 



inc n e int + £ c n e int = 1 + £ ^ e int , 

71GZ tiGZ nGZ 



Co + y^(m + 1 )c n e mt 
n^O 




Identification of coefficients yields Co = 1 + 1/(27 t) and, for n ^ 0, c n = 
1/ (27t(1 + in)). A solution should thus be given by 




e %nt 

1 + in 



By a stroke of luck, it happens that this series has been almost encountered 
before in the text: in Example 4.1 on page 76 f. we found that 



/(«) 



sinh7r 



n^O 



m 



where f(u) = e u for — 7r < u < n and / has period 27r. From this we can 
find that 

Y] -[— 7 - e inu = f(u ) 

“ 1 — m sinh 7 r J 

n^O 



1. 



On the other hand, the series on the left of this equation can be rewritten, 
using (— l) n = e m7r and letting t = 7r — u: 



E 



(-ir 

1 — in 



ginu 



= E 

n^O 



( — e in(7r-u) 

V •*•/ g—tnu _ \ 

1 + in 1 + in 



E 



gint 

1 + in ’ 



This means that our solution can be expressed in the following way: 



y(t) 



x 



27 tJ 27r\sinh7r 
In particular, 






2 sinh 7 r ' 



2 sinh 7 r 



y{t) = 1 + 



0 7T — t 



2 sinh 7 r 



= 1 + 



-t 



2 sinh 7r 



0 < t < 27T, 



since this condition on t is equivalent to -7r < 7r — t < n. At the points 
t = n • 27 t, 7/(t) has an upward jump of size 1 (check this!). 
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Let us check the solution by substitution into the equation. Differenti- 
ating, we find that y'(t ) contains the pulse S(t) at the origin, and between 
jumps one has y'(t) = —(y(t) — 1). This proves that we have indeed found 
a solution. □ 



Exercises 



4.30 Let / be the even function with period 2i r that satisfies f(t) = ir — t for 
0 < t < 7r. Determine /' and /", and use the result to find the Fourier 
series of /. 

4.31 Let / have period 2n and satisfy 



( e*, |t| < 7r/2 
[O, 7r/2 < \t\ < IT, 



Compute /' — /, and then determine the Fourier series of /. 



Summary of Chapter 4 

Definition 

If / is a sufficiently nice function defined on T, we define its Fourier coef- 
ficients by 

Cn = if T dt or t = Ut f{t C n,dt 

The Fourier series of / is the series 

oo 

y; c n e mt , resp. |a 0 + ^ (a n cos nt + b n sin nt). 

n£ Z n=l 



If / has a period other than 27 t, the formulae have to be adjusted accord- 
ingly. If / is even or odd, the formulae for a n and b n can be simplified. 

Theorem 

If two continuous functions / and g have the same Fourier coefficients, then 

/ = 9- 

Theorem 

If / is piecewise continuous on T and continuous at the point t, then, for 
this value of £, its Fourier series is summable (C, 1) to the value f(t). 

Theorem 

If / is continuous on T and its Fourier coefficients satisfy ^ \c n \ < oo, then 
its Fourier series converges absolutely and uniformly to f{t) on all of T. 
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Theorem 

If / is differentiable on T, then the Fourier series of the derivative /' can 
be found by termvise differentiation. 

Theorem 

If / G C k ( T), then its Fourier coefficients satisfy |c n | < M/\n\ k . 

Theorem 

If / is continuous except for jump discontinuities, and if it has (generalized) 
one-sided derivatives at a point £, then its Fourier series for this value of t 
converges with the sum ^ (/(£+) + /(£—)). 

Formulae for Fourier series are found on page 251. 



Historical notes 

Joseph Fourier was not the first person to consider trigonometric series of 
the kind that came to bear his name. Around 1750, both Daniel Bernoulli and 
Leonhard Euler were busy investigating these series, but the standard of rigor in 
mathematics then was not sufficient for a real understanding of them. Part of the 
problem was the fact that the notion of a function had not been made precise, 
and different people had different opinions on this matter. For example, a graph 
pieced together as in Figure 4.2 on page 78 was not considered to represent one 
function but several. It was not until the times of Bernhard Riemann and Karl 
Weierstrass that something similar to the modern concept of a function was 
born. In 1822, when Fourier’s great treatise appeared, it was generally regarded as 
absurd that a series with terms that were smooth and nice trigonometric functions 
should be able to represent functions that were not everywhere differentiable, or 
even worse — discontinuous! 

The convergence theorem (Theorem 4.5) as stated in the text is a weaker 
version of a result by the German mathematician J. Peter Lejeune-Dirichlet 
(1805-59). At the age of 19, the Hungarian Lipot Fejer (1880-1959) had the 
bright idea of applying Cesaro summation to Fourier series. 

In the twentieth century the really hard questions concerning the convergence 
of Fourier series were finally resolved, when Lennart Carleson (1928-) proved 
his famous Theorem 4.6. The author of this book, then a graduate student, 
attended the series of seminars in the fall of 1965 when Carleson step by step 
conquered the obstacles in his way. The final proof consists of 23 packed pages in 
one of the world’s most famous mathematical journals, the Acta Mathematica. 



Problems for Chapter 4 

4.32 Determine the Fourier series of the following functions. Also state what is 
the sum of the series for all t. 

(a) f(t) = 2 + 7 cos 3t — 4 sin 2 £, — 7r < t < 7r. 

(b) /(£) = |sin£|, — 7r < t < tt. 
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(c) f(t) = (7T ~ t)(7T + t), ~7T < t < 7 T. 

(d) /(t) = e 1 * 1 , 7T < t < IT . 

4.33 Find the cosine series of f(t ) = sint, 0 < t < n. 

4.34 Find the sine series of f(t) = cost, 0 < t < 7r. Use this series to show that 

ttx/2 13 5 7 

16 ~ 2 2 — 1 6 2 - 1 + 10 2 - 1 14 2 - 1 + ” ‘ ' 

4.35 Let / be the 27r-periodic continuation of the function H(t — a) — H(t — 6), 
where —tt < a < b < 7r. Find the Fourier series of /. For what values of t 
does it converge? Indicate its sum for for all such t € [ — 7r, 7 t] . 

4.36 Let / be given by f(x) = -1 for -1 < x < 0, f(x) = x for 0 < x < 1 and 

f(x + 2) = f(x) for all x. Compute the Fourier series of /. State the sum 

of this series for x = 10, x = 10.5, and x = 11. 

4.37 Develop /(t) = t(t — 1), 0 < t < 1, period 1, in a Fourier series. Quote 
some criterion that implies that the series converges to f(t) for all values 
of t. 

4.38 The function / is defined by f(t) = t 2 for 0 < t < 1, f(t) = 0 for 1 < t < 2 
and by the statement that it has period 2. 

(a) Develop / in a Fourier series with period 2 and indicate the sum of the 
series in the interval [0, 5]. 

oo f-i) n 

(b) Compute the value of the sum s = ^2 ~ — 2 • 

n=l ^ 

4.39 Suppose that / is integrable, has period T, and Fourier series 

00 

/(t)~ c ™e 2 * int/T . 

n= — 00 

Determine the Fourier series of the so-called autocorrelation function r of 
/, which is defined by 

r (<) = /(* + u ) f( u ) du - 

4.40 An application to sound waves: Suppose the variation in pressure, p, that 
causes a sound has period ^ s (seconds), and satisfies 

p(t) = 1, 0 < t < Yo48’ p(t) = 8 ’ 1048 < ^ < 524’ 

P(t) ~ 8 ’ 524 < ^ ^ T048 ’ = 1048 ^ ^ ^ 262* 

What frequencies can be heard in this sound? Which is the dominant fre- 
quency? 

4.41 Compute the Fourier series of /, given by 

X 

f(x) = sin - , -7r < x < 7r; f(x + 27r) = /(x), x e R. 

Then find the values of the sums 

Sl =E4^TI s2 = Ei^- 
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4.42 Let / be an even function of period 2ir described by /(x) = cos2x for 
0 < x < |7r and f{x) = —1 for |7r < x < 7 r. Find its Fourier series and 
compute the value of the sum 



s = 



E 



k= 1 



(—i) k 

(2Jfc + l)(2fc-l)(2fe + 3) ' 



4.43 Find all solutions y(t) with period 27 r of the differential-difference equation 

y(t) + y(t — |7r) — y(t — 7 r) = cost, — oo < t < oo. 

4.44 Let / be an even function with period 4 such that f(x) = \—x for 0 < x < 1 
and f(x) — 0 for 1 < x < 2. Find its Fourier series and compute 

°° 1 

S = T,(2k+l)f 



4.45 Let a be a real number but not an integer. Define f(x) by putting f(x) = 
e tax for —7 r < x < n and f(x + 27 r) = f(x). By studying its Fourier series, 
prove the following formulae: 

7r 1 2(— l) n a 

sin 7ra a ^ a 2 — n 2 

n=l 



sin 7ra 



- E 



(a — n) 2 



4.46 Compute the Fourier series of the 27r-periodic function / given by f(x) = 
x 3 — 7r 2 x for —7 r <x <i r. Find the sum 



5 = 



E 



(~l) n+1 
(2 n - l) 3 ‘ 



4.47 Let / be a 27r-periodic function with (“complex”) Fourier coefficients c n 
(n € Z). Assume that for an integer k > 0 it holds that 

^|n| fc |c„| <oo. 

nez 

Prove that / is of class i.e., that the kth derivative of / is continuous. 

4.48 Find the Fourier series of / with period 2 which is given for |x| < 1 by 
f(x) = 2x 2 — x 4 . The result can be used to find the value of the sum 



CM - E 



n 4 ’ 
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5.1 Linear spaces over the complex numbers 

We assume that the reader is familiar with vector spaces, where the scalars 
are real numbers. It is also possible, and indeed very fruitful, to allow scalars 
to be complex numbers. Thus, we consider now a set V, whose elements can 
be added to each other, and also can be multiplied by complex numbers, 
to produce new elements in V ; and these operations are to obey the same 
rules of calculation as in ordinary real vector spaces. Simple examples of 
such complex vector spaces are given by C n , which consists of n-tuples 
z = (zi, Z2, . • • , z n ) of complex numbers. Another example is the set of all 
complex- valued functions / defined on an interval [a, b]. 

It turns out that elementary complex linear algebra can be developed in 
almost exact parallel to its real counterpart. Linear dependence, subspaces, 
and dimension can be defined word for word in the same way, with the un- 
derstanding that all scalars occurring in the process are complex numbers. 
This gives a new significance to the notion of dimension. For example, C 
is a one- dimensional complex vector space. It could also be considered as 
a real vector space, but then the dimension is 2 . In the same manner, C n 
has complex dimension n but real dimension 2 n. 

When we reach the notions of scalar product and distance, we must, 
however, modify the details. The old formula for the (standard) scalar 
product of two vectors z and w in n-space has the form (z, w) = z\W\ + 
Z2W2 + • • • + z n w n . This cannot be allowed to hold any more, because it 
does not work in a reasonable way. 
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Example 5.1. Consider the vector z = (l,i) G C 2 . With the usual 
formula for the scalar product we would get |z| 2 = l*l + 2 *z = l — 1 = 0. 
The length of the vector would be 0, which is no good. Still more strange 
would be the case for the vector w = ( 1 , 22 ), which would not even have 
real length. □ 

By considering the one-dimensional space C, we can get an idea for a 
more suitable definition. The vector, or number, z = x+iy is normally iden- 
tified with the vector (x, y) in R 2 . The length of this vector is the number 
\z\ = yjx 2 +y 2 = y/&. If the scalar product of two complex numbers z 
and w is defined to be (z,w) = zw , we get the formula \z \ 2 = (z,z). This 
means that we have to modify a few rules for the scalar product, but that 
price is well worth paying. Just to indicate that the rules of computation 
are no longer exactly the same, we choose to use the name inner product 
instead of scalar product. 

Definition 5.1 Let V be a complex vector space. An inner product on V 
is a complex-valued function (u, v) of u and v G V having the following 
properties: 



(1) (u,v) = (v,u) (Hermitian symmetry) 

(2) (cm + /?v, w) = a(u , w) + (3(v, w) (Linearity in the first argument) 

(3) (i i,u) > 0 

(4) (u, u) = 0 => u = 0 



( Positive- definiteness ) 



Combining rules 1 and 2 we find the rule 

(5) (ia, av + /3w) = a(u , v) + (3(u, w). 



Example 5.2. In C n , we can define an inner product (using natural 
notation) by the formula 



(z, w) = Z\W\ + Z 2 W 2 H h Z n w n . 



□ 

Example 5.3. Let C(a, b) be the set of continuous, complex- valued func- 
tions defined on the compact interval [a, b \ , and put 

if, 9)= f f{x)g(x)dx. 

J a 

The fact that this is an inner product is almost trivial, except possibly 
for condition 4. That implication follows from the fact that if a continu- 
ous function is non-negative on an interval and its integral is 0, then the 
function must indeed be identically 0 (see textbooks on calculus). □ 
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Example 5.4. More generally, let w be a fixed continuous function on 
[a, b] such that w(x) > 0, and put 



(f,g}= f 

J a 



f(x)g(x)w(x)dx. 



□ 

A complex vector space with a chosen inner product is called an inner 
product space . Because of rules 3 and 4 it is natural to define a measure of 
the size of a vector in such a space, corresponding to the length in the real 
case. One prefers to use the word norm instead of length and write 

the norm of u = |NI = yj ( u , u). 

In the case described in Example 5.3 we thus have 

ll/ll 2 = [ b \f{x)\ 2 dx. 

J a 

The following inequalities are wellknown in the real case, and they hold 
just as well in the new setting: 

Theorem 5.1 

| (u,v ) | < \\v\\ (Cauchy-Schwarz inequality) 

\\u + ^|| < INI + |H| (Triangle inequality) 



Proof, (a) If u = 0, then both members are 0 and the statement is true. 
Thus, let us assume that u ^ 0. Put a = —(v,u)/(u,u). Then, 



0 < || au + ^|| 2 = (au + v, au + v) = aa{u, u) + ol(u, v) + a{v, u) -f- N v) 

(v,u) • (u,v) . , (v,u) . . . . , . 

- N N 7 M - j-L-L (u,v) - )-L± ( VyU ) + (v,v) 



(u,u) 2 

INN 2 

(u,u) 



(u,u) 



{u,u) 



+ (v,v). 



Rearranging this we arrive at the Cauchy-Schwarz inequality, 
(b) The triangle inequality is proved using Cauchy-Schwarz: 



\\u + v\ 



= (u + V, u + v ) = INI 2 + (u, v) + (v, u) -j- |N| 2 
= |M| 2 + 2Re(u,v) + ||f|| 2 < |N| 2 + 2|(w,v)| + \\v\\ 2 

<IHI 2 + 2H imi + ihi 2 = (ihi + imi) 2 . 



Since both \\u + v|| and |NI + \\v\\ are non-negative numbers, the triangle 
inequality follows. □ 
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Two vectors u and v are called orthogonal with respect to the chosen 
inner product, if (u,v) = 0. A vector u is normed if ||it|| = 1; and an 
orthonormal , or simply ON, set of vectors consists of normed and pairwise 
orthogonal vectors. 

Example 5.5. C'(T) consists of the continuous complex- valued functions 
on the unit circle T, and let the inner product be defined by 

(/, 9 ) = ^ f( x ) 9(x) dx. 

Let ipk(x) = e lkx , k e Z. Then we have 

{<Pm, Vn) = L [ e imx e~ inx d® = L r e i(m-n)x dx = I 0 ’ m 

^ J T ^ J — 7T L L ^ ^ 



The sequence {<fk}kL - oo ^ us an orthonormal set in C( T). □ 

Just as in the case of real spaces one can show that a set of non-zero, 
pairwise orthogonal vectors is linearly independent. In a space with finite 
(complex) dimension AT, an orthonormal set can thus contain at most N 
vectors. On the other hand, there always exists such a set. Starting from 
an arbitrary basis in the space, one can use the Gram-Schmidt process to 
construct an ON basis, working precisely as in the real case. 

Example 5.6. As a reminder of the Gram-Schmidt process, we construct an 
orthonormal basis in the space of polynomials of degree at most 2, with the inner 
product 

(/> 9) = [ f(x)g{x)( 1 + x) dx. 

Jo 

The “raw material” consists of the vectors uo,ui,U 2 defined by 

Uo(x) = 1 , U\(x ) = X , U 2 (x) = X 2 . 



As a tentative first vector in the new basis we choose vo = uo, which is to be 
adjusted to have unit norm. Since 



IMI 2 = (wo, Wo) 
the normed vector is described by 

< Po(x ) = 



/■ 

Jo 



1 • 1 • (1 + x) dx ■ 



v 0 (x) 

IMI 






Next we adjust u\ so that it is orthogonal to (fo . This is done by subtracting a 
certain multiple of (fo according to the following recipe: 



Vi = Ui - (ui,(po) (fo. 



The inner product is 
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Thus, 

The squared norm of v\ is 

IHI 2 = (vi,V!) = ( (x — f ) 2 (a: + l)dx = . 

Jo 

Thus the second normed basis vector is 




The third vector is obtained by the following steps: 

V2=U 2 ~ (u 2 ,tpo)<PO - (U2,lfil)<pi, («2, <fo) = (u 2 , <fil ) = ^ V 39 , 

V2 — X — gg ^ + 55 > {V2,V2) = 2 600 ’ 

<P2 = £VM6(x 2 -§x+^). 

It is obvious that the computations quickly grow very involved, not least be- 
cause the norms turn out to be rather hideous numbers. Mostly, therefore, one is 
satisfied with just orthogonal sets, instead of orthonormal sets. □ 

There is also the following theorem, which should be proved by the 
reader: 



Theorem 5.2 Ifipi, • • • , <£jv is an ON basis in an N -dimensional inner 

N 

product space V , then every u G V can be written as u = ^ (-U, (pj)<Pj, and 

j = i 



furthermore one has 



N 

Nl 2 — ^2 I (w, ipj ) | 2 (theorem of Pythagoras) . 

3 = i 

For the inner product of two vectors one also has the following formula: 

N 

(u,v) = '^2{u,<p j )(v,<p j ). 

3 = 1 



Exercises 

5.1 Let u = (1 — 2z, 3, 2 + i) and v = (i, 1 — 3 2 , 0) be vectors in C 3 . Compute 
||u||, ||v||, and (u,v). 

5.2 Let C(-l, 1) be as in Example 5.3 above, and let / and g G C(— 1, 1) be 
described by f(x) = x + 2, g(x) = ix -f x 2 . Compute (/, g). 

5.3 Are the functions 1, x, x 2 , ..., x n , sinx, cosx, e x linearly independent, 
considered as vectors in <7(0, 1) ? (n is some positive integer.) 
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5.4 Apply the Gram-Schmidt orthogonalization procedure to construct an or- 
thogonal basis in the subspace of (7(0, 1) spanned by the polynomials 1, x 
and x 2 . 

5.5 Orthogonalize the following sets of vectors: 

(a) the vectors (1, 2, 3), (3, 1, 4) and (2, 1, 1) in C 3 ; 

(b) the functions 1, x, x 2 i (7(— 1, 1); 

(c) the functions e ~ x , xe ~ x , x 2 e~ x in (7(0, oo). 

5.6 Prove the following formula for the inner product: 

4 (u, v) = ||u -f v \\ 2 + i\\u + iv || 2 — || u — v \\ 2 — i\\u — iv\\ 2 . 



5.2 Orthogonal projections 

The reader should recall the following fact from real linear algebra: Let 
Wk}k=i be an orthonormal set in the space V , and let u be an arbitrary 
vector in V . The orthogonal projection of u on to the subspace of V spanned 
by {<Pk}k=i is the vector 



N 

P N (u) = (u, + {u, tp2)<P2 H + {u, <Pn)<PN = ^(u, Vk)Vk- 

k = 1 

This definition works just as well in the complex case, and the projection 
thus defined has also the least-squares property described in the following 
theorem. 

Theorem 5.3 (Least squares approximation) Let {(fk}k=i be an or- 
thonormal set in an inner product space V and let u be a vector in V . 
Among all the linear combinations <I> = Ylk=i IWkj the one that minimizes 
the value of\\u — 3>|| is the one with the coefficients 7 k equal to (u,(pk), i-e., 
$ = P N (u). 



Proof. For brevity, we write (u, (fk) = Ck- We get 

< N N \ 

u-'E'ykVk} 

k= 1 k= 1 / 

= (u,u) - lu, J2^fk<fk ) - \^lkVk, u\ + (^ 7 k<Pk, Yl'IkVk 



\ k = 1 / \k = 1 / \k = 1 k = 1 

N N N N 

J k (u,ip k ) - + EE lklj{<Pk,Vj) 

k = 1 k = 1 k=lj=l 

N N 

= («> «) - + 7 kCk) + 7fc7fc 



k = 1 



k = 1 
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N N 

= IMI 2 + - 7 feCfc - Ik&k + c k c k ) - \°k\ 2 

k = 1 fc=l 

= imi 2 + Y2 ~ Cfc i 2 ~ 52 mi 2 - 

/c=l k = 1 

In the last expression, we note that u is given in the formulation of the 
theorem; also, since the (fk are given, the Ck = (u, y>k) are given. But we 
have the 7^ at our disposal, to minimize the value of the expression. In 
order to make it as small as possible, we choose the numbers 7^ to be equal 
to Cfc, which proves the assertion. □ 

A couple of consequences of the proof are worth emphasizing. When a 
vector u is approximated by the orthogonal projection Pjv('u), then the 
“error” u — Pn{u) is called the residual. An important fact is that the 
residual is orthogonal to the projection. As for the “size” of the residual, 
we have the identity (“Pythagoras’ theorem”) 

ll« - -Piv(u)ll 2 = IMI 2 - ||JM«)II 2 - 



If this is written out in full, it becomes 

N 






N 



= U 



-ZlMrf 



( 5 . 1 ) 



k=l k -1 

Since the left-hand member of these equations is non-negative, we also have 

IMMMII 2 < IMI 2 , or HJWtOII < IMI, 



or, indeed, 



N 



£im^>i 2 <imi 2 

k = 1 



If the ON set contains infinitely many vectors, we can let N — >• 00 and get 
what is known as the Bessel inequality: 



ZlMvMI 2 < IMI 2 - 

k = 1 



Let us now concentrate on the case when the space V has infinite dimen- 
sion. A system {(pj } ( ^ =1 in V is said to be complete in V if, for every u E V 

and every e > 0, there exists a linear combination 1 a j ( Pj such that 



N 

u 



3 = 1 



< £. 
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This means that every u in V can be approximated arbitrarily closely, in the 
sense of the norm, by linear combinations of the elements in the set {y>j}- 
The theorem above shows that the best approximation must be given by 
the “infinite projection” Y?jLi( u i Tj)Tj °f u - A condition that is equivalent 
to completeness is given by the following theorem: 

Theorem 5.4 The ON system {^Pj}j^i is complete in V if and only if for 
every u eV it holds that 



= X]K“»¥»j>| 2 



3 = 1 



(the Parseval formula or the completeness relation). 

The proof follows from (5.1). 

As a corollary we also have the following: 



Theorem 5.5 (Expansion theorem) If the ON system is com- 

plete in V, then every u G V can be written as u = Y^jLi( u i ( Pj) l Pjf where 



the series converges in the sense of the norm (i.e., 
0 as N -» oo). 






Furthermore, one has the following theorem, which generalizes the usual 
formula (in finite dimension) for computing the inner product in an ON 
basis: 



Theorem 5.6 If the ON system {Tj}j^zi is complete in V, then 



oo 

(u,v) = 

3 = 1 



for all u, v G V. 

Proof Let P n (u) be the projection of u on to the subspace spanned by the 
n first (/?’ s: 

n 

Pn(u) = Y J ( U ’ ( Pj) l Pj ■ 

3 = 1 

By Theorem 5.2 we have 



(P n (u),P n (v)) = 

J=1 



Using the triangle and Cauchy-Schwarz inequalities, we get 
|(u,v) - {P n (u),P n (v)}\ 
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= |(u,v) - (u,P n {v)) + (u,P n (v)) - (P n (u),P n (v )) I 

< \{u,V - Pn(v))\ + |(u - P n (u),P n {v))\ 

< ||«|| ||w - P n (v ) II + ||u - P n (u ) II ||P„(u)|| 

< ||u|| ||u - P n (v) II + ||u - P n («)|| || V ||. 

In the last part we also used Bessel’s inequality. Now we know, because of 
completeness, that ||v — P n (v ) || -> 0 as n -» oo, and similarly for u , which 
implies that the final member of the estimate tends to zero. Then also the 
first member must tend to zero, and so 

oo 

(u,v) = lim (P n (u),P n (v)) = y2{u,(pj)(v,(pj), 

n— >oo L ' 

3 = 1 



and the proof is complete. 



□ 



Remark. Using an estimate of the same kind as in the proof, one can see that 
(it, v) is a continuous function of u and v in the sense that if u n -» u and v n -> v 
(in the sense of the norm), then (u n ,v n ) — > (u,v). □ 



In the interest of simplicity, we have throughout this section been working 
with ortho normal systems. In practice one is often satisfied with using 
orthogonal systems, since the normalizing factors can be quite unwieldy 
numbers. In such a case, our formulae have to be somewhat modified. 

The projection of u on to an orthogonal set of vectors {(/pj}^ =1 is given 
by 



N 



Pn(u) = ^2 

3 = 1 



(U, <Pj) 
(Vj 5 V 3 ) 



N 
3 = 1 



(u,Vj) .. 

Il^ll 2 V3 ' 



The other formulae must be adjusted in the same vein: every (pj that occurs 
has to be divided by its norm, and this holds whether pj is “free” or is part 
of an inner product. For example, the Parseval formula takes the form 



u 



viiHi Ml! 
U n^ii 2 



and the formula for inner products is 



{u,v 






i=i 



(U,<pj)(v,<fij) 

\Wi II 2 



Exercises 

5.7 Determine the polynomial p of degree at most 1 that minimizes 

f 0 \ eX ~p ( x ) | 2 dx. (Hint: first find an orthogonal basis for a suitably chosen 
space of polynomials of degree < 1.) 
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5.8 Determine the constants a and b in order to minimize the integral 
f_ | ax H- bx 2 — sin7rx| 2 dx. 

5.9 Find the polynomial p(x) of degree at most 2 that minimizes the integral 

/ tt/2 

I sin x — p(x) | 2 cos x dx. 

■7r/2 

5.3 Some examples 

We have already seen the finite-dimensional inner-product spaces C n . A 
generalization of these spaces can be constructed in the following manner. 
Let M be an arbitrary set (with finitely or infinitely many elements). Let 
l 2 (M) be the set of all functions a : M C such that 

Y K*)| 2 < oo. (5.2) 

xeM 

The fact that this defines a linear space can be proved in the following way. 
Because of the inequality 



N < §(|p| 2 + M 2 ), (5.3) 

that holds for all complex numbers p and g, the following estimate is true: 

|Aa + fib\ 2 = (A a + fib)(Xa + Jib) = |A| 2 |a| 2 + |/x| 2 16| 2 + 2Re {A Jlab} 

< |A| 2 |a| 2 + H 2 |b| 2 + 2|Aa||/i6| < 2(|A| 2 |a| 2 + |/x| 2 |5| 2 ). 

Using this, and the comparison test for positive series, one finds that if 
J2\ a ( x )\ 2 an d ^2 \K X )\ 2 are both convergent, then ^ \Xcl(x) + fib{x) \ 2 is 
also convergent. This means that linear combinations of elements in l 2 (M) 
also belong to l 2 (M). Using (5.3) we also see that if both a and b are 
members of / 2 (M), then the series 

Y, o (*) b ( x ) 

xEM 

will converge absolutely. This expression can be taken as the definition of 
an inner product (a, b). The square of the norm of a is then given by the 
left member of (5.2). 

If, as an example, we choose M = N = {0, 1, 2, . . .}, we can write a n 
instead of a(n) and get the inner product This certainly looks 

like a natural generalization of C n . 

When we gave examples of inner products in function spaces in Sec. 5.1, 
we assumed, for convenience, that all the functions were continuous. It is, 
however, often desirable to be able to work with more general functions. 
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One such class is the class of Riemann-integrable functions, which consists 
of functions that can be approximated in a certain way by so-called step 
functions; the class includes, for example, functions with a finite number of 
jumps. In order to get a really efficient theory, one should actually go still 
further and allow measurable functions in the sense of Lebesgue. However, 
this is a rather complicated step, and in this text we shall confine ourselves 
to more “ordinary” functions. At one point (p. 121), we shall, however, 
mention how the Lebesgue functions obtrude upon us. 

In what follows, it will often be of no interest whether an interval contains 
its endpoints or not. For simplicity, we shall write (a, 5), which can be 
interpreted at will to mean either [a, 6] or [a, b[, etc. 

Let I be an interval, bounded or unbounded, and let w : I — >>]0, oo[ be a 
positive, continuous, real-valued function on I. Let finally p be a number 
> 1. The set L p (I,w) is defined to consist of all (Lebesgue-measurable) 
functions / such that 




dx < oo. 



(The integral may be improper in one way or another, without this being 
indicated when we write it.) It can be proved that this defines a linear 
space: if / and g belong to it, then the same goes for all linear combinations 
af + 0g. The proof is very simple in the case p — 1: 

j\af + 0g\wdx < |a| l \f\wdx + \P\ l \g\wdx. 

For p = 2, it can be done in a way that is analogous to what was done 
for l 2 (M) above (integrals replacing sums). For other values of p it is more 
difficult, and we skip it here. 

The space L p (I,w) is called the Lebesgue space with weight function w 
and exponent p. If the weight is identically 1 on all of /, one simply writes 
L P (I). For / G L p (I,w) one can define a norm 



\\p,w 






\1 /P 

w(x) dx I , 



that can be used to introduce a notion of distance \\f — g\\ PiW between two 
functions / and g. This, in turn, gives rise to a notion of convergence : we say 
that a sequence of functions f n converges to g in L P (J, w) if \\f n — d\\p,w — >• 0 
as n — ^ oo. 

In order for this to make sense, it is necessary to modify slightly what is 
meant by equality between functions. Two functions / and g are considered 
to be equal if their mutual distance is zero; explicitly this means that 

J \f(x) - g(x)\ p w(x)dx -0. 
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This does not necessarily imply that f(x) = g(x) for all x, it means only 
that they are equal “almost everywhere”: the set where they differ is a 
so-called zero set (see the text preceding Theorem 4.6 on page 89). When 
working with “nice” functions, which are continuous except for a finite 
number of jumps, this means that the function values are actually equal 
except possibly for at a finite number of points. 

We shall almost exclusively consider the case p = 2. It can be proved 
that only in this case can there exist an inner product (f,g) in L p {I,w) 
such that the norm is recovered through the formula ||/|| 2 = (/,/). This 
inner product must be defined by 



(f,g) 






x) g(x) w(x) dx. 



Let us now look at a few important cases. 



Example 5.7. Let I = (— 7r, 7r) or, equivalently, interpret I as the unit 
circle T, and take w(x) = 1. The inner product here is almost the same as in 
Example 5.5, page 108. There we showed that the functions p n (x) = e inx 
are orthogonal in C( T), and this holds equally well in L 2 (T). Because 
the factor 1/(27 r) is now missing in the inner product, they are no longer 
normed, however, but the norm of each ip n is now y/2n. 

The formula for projections on to an orthogonal set of vectors has the 
form 



N 



PM = E 

71=1 



(/? Vn) 

(Pm ty?n) 



N 

Pn = T 

n = 1 



(/? Vn) 

M 2 



N 

Pn — ^ ^ CnPn i 
71=1 



where 



C71 — 



(f,Vn) 

lbn || 2 



1 

27T 




e inx dx = 



1 

27 r 




e~ inx dx. 



We recognize our good old Fourier coefficients. 



□ 



Example 5.8. In the same space as in the preceding example we study the 
system consisting of firstly all the functions <p n (x) = cos nx, n = 0, 1, 2, . . ., 
secondly all the functions ^(x) = sin nx, n = 1, 2, 3, — Using some suit- 
able trigonometric formula (or Euler’s formulae), it is easily proved that all 
these functions are mutually orthogonal, and furthermore (ipo,<Po) — 27r, 
while all the other members of the system have the square of the norm 
equal to 7r. Here, too, the coefficients in the orthogonal projections turn 
out to be the well-known classical Fourier coefficients. □ 



Example 5.9. In the space L 2 (0,7r), the functions p n (x) = cos nx, 
n — 0, 1, 2, . . . form an orthogonal set, and ^ n (x) = sin nx, n = 1, 2, 3, . . . 
another orthogonal set (check it!). However, as a rule, a is not orthog- 
onal to a -0 n . □ 
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Examples 5.7 and 5.8 provide a new viewpoint for considering Fourier 
series. Such series can be regarded as representations with respect to an 
orthogonal system. The general results in Sections 5.1-2 concerning such 
representations are thus valid for our Fourier expansions. As an example, 
we formulate what Theorem 5.3 purports in the case of “real” Fourier series. 

Theorem 5.7 Let f G L 2 (T) and let N be a fixed non-negative integer. 
Among all trigonometric polynomials of the form 

N 

Pn {x) = \ oo + ^ {oi n cos nx + (3 n sin nx ) , 

71 = 1 

the polynomial that minimizes the value of the integral 

J ^ \f{x) - PN(x)\ 2 dx 

is the one where the coefficients are the usual Fourier coefficients of f, viz., 
(using the ordinary notation) where a n = a n and (3 n = b n . 

We have also, for example, Bessel’s inequality, which can be written, in 
complex and real guise, respectively, as 

'F, l C «| 2 j \f( x )\ 2dx i l\ a o\ 2 +^2(\an\ 2 M b n\ 2 ) < - / \f{x)\ 2 dx. 

The question of completeness will be addressed in the next section. 

Remark. At this point, the reader will have been confronted by a number of 
different notions of convergence for sequences of functions. There is pointwise 
convergence and uniform convergence, and now we have also various versions of 
L p convergence. It is natural to ask whether there are any sensible connections 
between all these notions. 

It is trivial that uniform convergence implies pointwise convergence. If one 
works on a finite interval (a, 6) with weight 1, it is also easy to see that uni- 
form convergence implies convergence in L p (a,b). Indeed, suppose that f n — > / 
uniformly on (a, b). Then ( p > 1) 

' b \!/P / nb 

I fn{t) - f(t)\ p dt ) < ( / sup I /„(*) - f{t)\ p dt 

) \Ja (a, b) 

= sup |/„(£) - f(t) I ( [ dt\ =C sup I f„(t) - /(t)| ->■ 0 

(a, b) \J a / (a, 6) 




as n — * oo. 

In the converse direction, L p convergence does not even imply pointwise con- 
vergence. This is shown by the following example; for simplicity it is formulated 
in L 1 (0, 1), but the same functions can be used to prove the same thing in all 
L p ( 0, 1), and they are easily rescaled to suit other intervals. 
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FIGURE 5.1. 

Let fi(t) = 1 on the interval I = (0,1). Then ||/i||i = 1. Let then /2 be 1 
on (0, |) and 0 on the rest of 7; /3 is set to 1 on (|, |) and 0 otherwise; f± is 
1 on the intervals (|, 1) and (0, ^), and 0 otherwise. In general: f n is equal to 
1 on one or two intervals with total length equal to 1/n, and, in a manner of 
speaking, f n starts where f n - i ends; and if you exceed the right-hand end of 7 
you put the remainder on the far left of 7 and start again moving to the right 
(see Figure 5.1). Then it holds that ||/ n ||i = 1/n, and so f n -» 0 in the sense of 
norm convergence in L 1 ( I). 

Since the series 




n= 1 



is divergent, in the sense that its partial sums tend to plus infinity, the “piling of 
intervals” described above must run to an infinite number of “turns.” If x is an 
arbitrary point in 7, there will thus exist arbitrarily large values of n such that 
fn(x) = 1, but also arbitrarily large values of n for which / n (x) = 0. In other 
words, the sequence of numbers {/ n (x)}5?Li has no limit at all, which means that 
the sequence of functions {/ n } does not converge pointwise anywhere. 

The functions in this example are close to zero “in the mean” : when n is large, 
f n (x) is equal to zero on most of the interval. For this reason, L p convergence is 
often called convergence in the mean , more precisely in the L p mean. □ 



Exercises 

5.10 Find the three first orthonormal polynomials with respect to the inner 
product 



(/: 



\s)= r 

Jo 



f(x)g{x)xdx, 



by orthogonalizing the polynomials 1, x, ar. 

5.11 Solve the same problem as the preceding, when 



(/.»)= / f(x)g(x)e * 2 dx. 
J R 



5.12 Determine the polynomial of degree < 2, that is the best approximation to 
f(x) = y/\x\ in the space L 2 (7, w), where I = [—1, 1], w(x) = 1. 

5.13 Determine the complex numbers Cj ( j = 0, 1,2,3) in order to make the 
integral j \cq + c\ cosx + C2 cos2x + C3 cos3x — cos 4 x\ 2 dx as small as 
possible. 
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FIGURE 5.2. 

5.4 The Fourier system is complete 

We are now prepared to prove the following result, which, in a way, crowns 
the classical Fourier theory. 

Theorem 5.8 The two orthogonal systems {e mt } nG z and {cosnt, n > 
0; sin nt, n > 1} are each complete in L 2 ( T). 

Proof. We want to prove that if / E L 2 ( T), then / can be approximated 
arbitrarily well by linear combinations of elements from one of the systems. 
Since the elements of one system are simple linear combinations of elements 
from the other system, it does not matter which of the systems we consider. 
Our proof will be incomplete in one way: we only show that functions / 
that happen to be piecewise continuous can be approximated; but this is 
really just due to the fact that we have not said anything about what a 
more general / might look like. 

Thus we assume that / is piecewise continuous. It is then bounded on T, 
so that \f(t)\ < M for some constant M and all t. Now let e > 0 be given. 
By the definition of the Riemann integral, there exists a step function (i.e., 
a piecewise constant function) g such that 

Clearly we can choose g in such a way that \g(t)\ < M, and then it follows 
that 

11/ - fff = jf 1/(0 - s(t) I 2 dt < jf 2M\f(t) - g(t)\dt < 2M • ^ = e 2 , 
and so ||/ - p|| < e. 

In the next step of the proof we round off the corners of the step function 
g to obtain a C 2 function h such that || g — ft|| < e. At this point, the 
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author appeals to the reader’s willingness to accept that this is possible 
(see Figure 5.2). The function h has a Fourier series Ylj n^ xnt •> where the 
coefficients satisfy | 7 n | < C/n 2 for some C (by Theorem 4.4), which implies 
that the series converges uniformly. If we take N sufficiently large, the 
partial sum h) of this series will thus satisfy 

\h(t) - s N (t;h)\ < -|= , t€ T. 

v Z7T 

From this we conclude that 

||/i-sat(-;/i)|| 2 = J^\h(t) - s N (t;h)\ 2 dt < ^ ■ 2ir = £ 2 , 

so that \\h — sn( • ; ft) II < e. Finally, let sn be the corresponding partial sum 
of the Fourier series of the function / that we started with. Because of the 
Approximation theorem (Theorem 5.3), it is certainly true that ||/ — sn\\ < 
||/ — s/v( • ; ft)ll- Time is now ripe for combining all our approximations in 
this manner: 



11/ - Sivll < 11/ - S N { • ; /i)|| = 11/ - g + g - h + h - s N ( ■ ; h)|| 

< 11/ - ffll + IIP - fc|| + ||/i - s N ( ■ ; ft) || < £ + s + e = 3e. 



This means that / can be approximated to within 3£ by a certain linear 
combination of the functions e int , and since e can be chosen arbitrarily 
small we have proved the theorem. □ 

As a consequence we now have the Parseval formula and the formula for 
the inner product (which is also often called Parseval’s formula, sometimes 
qualified as the polarized Parseval formula). For the “complex” system these 
formulae take the form 



27 r 



L 



\f(t)\ 2 dt = |c n 
ne z 



2 



// f(t)g(t)dt=Y c ndn, 

2n Jr 7^7. 



and for the “real” system, 

i r °° 

_ / | f(t)\ 2 dt= ^|qq | 2 + y^(K| 2 + \K\ 2 ), 

71=1 

l r 00 

- f(t)g(t)dt=laoc^ + Y{a n a^ + b n (3 n )- 
ft J T ^ 1 

n — 1 

(The reader will have to figure out independently how the letters on the 
right correspond to the functions involved.) 

Example 5.10. In Sec. 4.1, we saw that the odd function / with period 
27 r that is described by f(t) = (tt — t)/2 for 0 < t < n has Fourier series 
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dt = — 
27 r 



(tt - 1 ) 



31 



Jo 



^^°(sinnt)/n. Parseval’s formula looks like this 

This provides yet another way of finding the value of ((2). □ 

Example 5.11. For f(t) = t 2 on |£| < n we had ao = 27t 2 /3, a n = 
4(— 1 ) n /n 2 for n > 1 and all 6 n = 0. Parseval’s formula becomes 






which can be solved for ((4) = 7 t 4 / 90 . □ 

Suppose that / is defined only on the interval (0,7r). Then / can be 
extended to an odd function on (— 7r,7r) by defining f(t) = for 

—7 r < t < 0 (if /( 0) happens to be defined already, this value may have 
to be changed to 0, but changing the value of a function at one point does 
not matter when dealing with Fourier series). The extended function can 
be expanded in a series containing only sine terms. Since ordinary Fourier 
series present a complete system in L 2 (— 7r,7r), / can be approximated as 
closely as we want in this space by partial sums of this series. But then / 
is also approximated in L 2 (0,7r) by the same partial sums (for the square 
of the norm in this space is exactly one half of the norm in L 2 (— 7r,7r) for 
odd functions, and, for that matter, even functions). We interpret this to 
say that the system ^ n (t) = sin nt, n > 1, is a complete orthogonal system 
on the interval (0, n) (the orthogonality was pointed out in Example 5.9). 

In an analogous way, a function on (0,7r) can be extended to an even 
function and be approximated by the partial sums of a cosine series. This 
shows that the orthogonal system <p n (t) = cos nt, n > 0, is also complete 
on (0,7r). 

We see that a function on (0, 7r) can be represented by either a sine series 
or a cosine series, whichever is suitable, and both these series converge 
to the function in the norm of L 2 (0,7r). This turns out to be useful in 
applications to problems for differential equations. 

Remark. The reader may now be asking the following question: suppose that 
{c n }n € z is a sequence of numbers such that |c n | 2 converges. Does there then 
exist some / E L 2 ( T) having these numbers as its Fourier coefficients? The 
answer is yes — provided we admit functions that are Lebesgue-measurable but 
not necessarily Riemann-integrable. If we do this, we actually have a bijective 
mapping between L 2 (T) and the space / 2 (Z), so that / E L 2 ( T) corresponds to 
the sequence of its Fourier coefficients, considered as an element of Z 2 (Z). □ 

As an application of Parseval we can prove the following nice theorem. 
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Theorem 5.9 Let f £ L 2 (T). If the Fourier series of f is integrated term 
by term over a finite interval (a, 6), the series obtained is convergent with 
the sum f(t) dt. 

Note that we do not assume that the Fourier series of / is convergent in 
itself! 

Proof. Let us first assume that the interval (a, b) is contained within a 
period, for instance, — n < a < b < 7T. We define a function g on T by 
letting g(t) = 1 for a < t < b and 0 otherwise. We compute the Fourier 
coefficients of g: 





-e~ ina ), 0; 



If now f ~J2 c n e%nt > the polarized Parseval relation takes the form 



Iz [ f(t)g{t)dt = J2 C n 
2n ^ 



2nn 



(< l~ inb - e -ina^ _|_ Cq 



b — a 
2ir 



2n 



E< 

n^O 



Q inb 



- e" 



m 



+ co(b ■ 



But the integral on the left is nothing but f(t) dt divided by 27T, and the 
terms in the sum on the right are just what you get if you integrate c n e int 
from a to b. After multiplication by 2n we have the assertion for this case. 

If the interval (a, b ) is longer than 27 r, it can be subdivided into pieces, 
each of them shorter than 27 r, and then we can use the case just proved 
on each piece. When the results are added, the contributions from the 
subdivision points will cancel (convergent series can be added termwise!), 
and the result follows also in the general case. □ 

If we choose a = 0 and let the upper limit of integration be variable 
and equal to £, the theorem gives a formula for the primitive functions 
(antiderivatives) of /: 

F(t)= / f(u)du + K = y Cn — +c 0 t + AT, 

JO n*l ) m 



where K is some constant. We shall rewrite this constant; first notice that, 
by the Cauchy-Schwarz inequality for sums, 




< oo 
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which means that the series c n/(^ n ) converges absolutely with sum 

equal to some number K\. Write C = K — K\ and use the fact that 
convergent series can be added term by term, and we find that 



m = Y, 

rc^O 



Cn 

in 



e int + c 0 t + c. 



This is the simplest form of the “formally” integrated Fourier series of 
/, and we have thus shown that this integration is permitted and indeed 
results in a series convergent for all t. 

In general, the integrated series is no longer a Fourier series; this happens 
only if Co = 0, i.e., if / has mean value 0. 



Exercises 

5.14 Using the result of Exercise 4.13 on page 84, find the value of the sum 



£ 



1 

(4n 2 — l) 2 * 



5.15 We reconnect to Exercise 4.22 on page 90. By studying the series estab- 
lished there on the interval (0,7r/2), prove the formula 



£ 



k= 0 



(-i) fc 

(2k + l)((2fc + l) 2 — a 2 ) 



7 r 

4a 2 



1 

COS |q7T 



- 1 



at Z. 



5.16 Let / be a continuous real- valued function on 0 < x < tt such that /( 0) = 
/( 7r) = 0 and /' G L 2 (0,7r). 

(a) Prove that f*(f(x)) 2 dx< f* (/' (x)) 2 dx. 

(b) For what functions does equality hold? 

(Hint: extend / to an odd function on (— 7r,7r).) 



5.5 Legendre polynomials 

A number of “classical” ON systems in various L 2 spaces consist of poly- 
nomials. Polynomials are very practical functions because their values al- 
ways can be computed exactly using elementary operations (addition and 
multiplication), which makes them immediately accessible to computers. 
If you want a value of a function such as e x or cosx, the effective calcu- 
lation always has to be performed using some sort of approximation, and 
this approximation often consists of some polynomial (or a combination of 
polynomials). 

The success of such approximations depends fundamentally on the va- 
lidity of the following theorem. 
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Theorem 5.10 (The Weierstrass approximation theorem) An arbi- 
trary continuous function f on a compact interval K can be approximated 
uniformly arbitrarily well by polynomials . 

In greater detail, the assertion is the following: If K is compact, / : 
K — > C is continuous and £ is any positive number, then there exists a 
polynomial P(x) such that | f(pc) — P(x) \ < £ for all x E K. 

A proof can be conducted along the following lines (which are best un- 
derstood by a reader who is familiar with slightly more than the barest 
elements about power series expansions of analytic functions): By a linear 
change of variable, the interval can be assumed to be, say, K = [0,7r]. On 
this interval, / can be represented by a cosine series (which involves, if you 
like, considering / to be extended to an even function). The Fejer sums cr n 
of this series converge uniformly to /, according to the remark following 
Theorem 4.1. We can then choose n to make swp K |/(x) — a n (x) \ < e/2. 
The function a n is a finite linear combination of functions of the form 
cos kx. These can be developed in Maclaurin series, each converging to 
its cosine function uniformly on every compact set, in particular on K. 
Take partial sums of these series and construct a polynomial P, such that 
sup K |cr n (x) — P(x) | < e/2. Using the triangle inequality one sees that P 
is a polynomial with the required property. 

Now we make things concrete. Let the interval be K = [—1, 1], so that 
we live in the space L 2 (— 1, 1), where the inner product is given by 

(/»5) = J J{x)g{x)dx. 

If we orthogonalize the polynomials 1, x, x 2 , . . . , according to the Gram- 
Schmidt procedure with respect to this inner product, the result is a se- 
quence {P n } of polynomials of degree 0,1,2, — They are traditionally 
scaled by the condition P n (l) = 1. The polynomials obtained are called 
Legendre polynomials. The first few Legendre polynomials are 

P 0 (x) = 1, Pl(x) = X, P 2 (x) = ^(3x 2 - 1), P 3 {x) = ^(5x 3 - 3x). 

We notice a few simple facts that are easily seen to be universally valid. 
The polynomial P n has degree exactly n; for odd n it contains only odd 
powers of x and for even n only even powers of x. An arbitrary polynomial 
p(x) of degree n can, in a unique way, be written as a linear combination 
of P 0 ,...,P n , with the coefficient in front of P n different from zero. We 
illustrate this with the example p(x) = x 2 + 3x: 

x 2 + 3x = |(P 2 (x) + \) + 3Pi(x) = |P 2 (x) + 3Pi(x) + |P 0 (x). 

We saw in Sec. 5.3 that uniform convergence implies L 2 -convergence on 
bounded intervals. By Theorem 5.10, continuous functions can be uniformly 
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approximated by polynomials; these can be rewritten as linear combina- 
tions of Legendre polynomials, and thus a continuous function on [—1, 1] 
can be approximated arbitrarily well by such expressions in the sense of 
L? . Just as in the proof of Theorem 5.8 it follows that the Legendre poly- 
nomials make up a complete orthogonal system in L 2 (— 1, 1). For historical 
reasons, they are not normed; instead one has ||P n || 2 = 2/(2 n + 1) (see 
Exercise 5.17). 

The following so-called Rodrigues formula holds for the Legendre poly- 
nomials: 

W = ^C"((* 2 -ir). (5.4) 



Example 5.12. Find the polynomial p(x) of degree at most 4 that mini- 
mizes the value of 



fj 



|sin7rx — p(x)\ 2 dx. 



Solution. By the general theory, the required polynomial can be obtained 
as the orthogonal projection of f(x) = sin7nr onto the first five Legendre 
polynomials: 

Since / is an odd function, its inner products with even functions are zero. 
Thus the sum reduces to just two terms: 



p(x ) = 



if, Pi) 
(Pi, Pi) 



Pl(x) + 



(. f,Pz ) 
(Pz,Pz) 



Pz{x). 



The denominators are taken from a table, ( Pk,Pk ) — 2/(2 k + 1), and the 
numerators are computed (taking advantage of symmetries): 



(f,Pz) = 2 [ 

Jo 



(sin 7 nr) • x dx = 2 




x sin 7 tx dx = 



l 

\ (5x 3 — 3x) sin 7 rx dx = 



2t r 2 - 30 



2 

7 r 



Putting everything together, we arrive at 

525 - 35tt 2 



. , 315-15?r 2 

= 2j8 * ■ 



27T 3 



« 2.6923x-2.8956x 3 . 



In Figure 5.3 we can see the graphs of / and p. For comparison, we have 
also included the Taylor polynomial T(x) of degree 3, that approximates 
f(x) near x = 0. It is clear that T and p serve quite different purposes: T 
is a very good approximation when we are close to the origin, but quite 
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FIGURE 5.3. Solid line: /, dashed: p, dotted: T 

worthless away from that point, whereas p is a reasonable approximation 
over the whole interval. □ 

We have standardized the situation in this section by choosing the inter- 
val to be (—1,1). By a simple linear change of variable everything can be 
transferred to an arbitrary finite interval (a, b). Here, the polynomials Q n 
make up a complete orthogonal system, if we let 

and the norm is given by 

ll^"H 2 = / \Qn(x)\ 2 dx= ^ ^ ■ 

When solving problems, use the formula collection in Appendix C, page 
254. 

Exercises 

5.17 Show that the polynomials defined by (5.4) are orthogonal and that ||P n || 2 
2 

= . (Hint: write down (P m ,P n ), where m < n, and, integrating by 

2n + 1 

parts, move differentiations from P n to P m . It is also possible to keep track 
of the leading coefficients.) 
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5.18 Find the best approximations with polynomials of degree at most 3, in the 
sense of L 2 (— 1,1), to the functions (a) H(x) (the Heaviside function), 
(b) 1/(1 + x 2 ). Draw pictures! 

5.19 Compare the result of Exercise 5.4, page 110, with what is said in the text 
about Legendre polynomials on an interval (a, b). 

5.20 Let po(x) = 1, pi(x) = x, and define p n for n > 2 by the recursion formula 

(n + l)p n -|_i (x) = (2n + 1 )xp n (x) — np n -i(x) for n = 1,2, Prove that 

p n is the same as P n . 

5.21 *Prove that u(x) = P n (x), as defined by Rodrigues’ formula, satisfies the 
differential equation 

(1 — x 2 )u"(x) — 2x u(x) + n(n + 1) u(x) = 0. 



5.6 Other classical orthogonal polynomials 

In this section we collect data concerning some orthogonal systems that 
have been studied ever since the nineteenth century, because they occur, 
for instance, in the study of problems for differential equations. Proofs 
may sometimes be supplied by an interested reader; see the exercises at 
the end of the section. When solving problems in the field, a handbook 
containing the formulae should of course be consulted. A small collection 
of such formulae is found on page 254 f. 

First we consider the L 2 space on the semi-infinite interval (0, oo) with 
the weight function w(x) = e~ x , which means that the inner product is 
given by 

poo 

(f,9)= f(x)g(x)e~ x dx. 

Jo 

The Laguerre polynomials L n {x) can be defined by a so-called Rodrigues 
formula (where D denotes differentiation with respect to x): 

L n (x) = ^D n (x n e- x ). (5.5) 

It is not hard to see that L n is actually a polynomial of degree n; it is 
somewhat more laborious to check that (L m , L n ) = S mn ; indeed these poly- 
nomials are not only orthogonal but even normed. See Exercises 5.22-23. 

Next we take the interval to be the whole axis R and the weight to be 
w(x) = e~ x . Thus the inner product is 

{. f,9 ) = f f(x) g(x) e~ x2 dx. 

JR 

The Hermite polynomials H n (x) can be defined by 
H n {x) = (~l) n e x * D n (e~ x2 ). 



(5.6) 
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The facts that these functions are actually polynomials of degree equal to 
the index n and that they are orthogonal with respect to the considered 
inner product are left to the reader in Exercises 5.24-25. The polynomials 
are not normed; instead one has ||iJ n || 2 = n\ 2 n y/n. 

Finally, we return to a finite interval, taken to be (—1,1), and let the 
weight function be 1/ y/1 — x 2 : 



(/, 9 ) = J x /(*) g(x) • 

Orthogonal polynomials are defined by the formula T n (x) = cos (n arccos x). 
They are called Chebyshev polynomials (which can be spelled in various 
ways: Cebysev, Chebyshev, Tschebyschew, Tchebycheff, and HeSbimeB are 
a few variants seen in the literature, the last one being (more or less) 
the original). That the formula actually defines polynomials is most easily 
recognized after the change of variable x = cos 0, 0 < 6 < 7r, which gives 
the formula T n ( cos0) = cosn0, and it is well known that cos nO can be 
expressed as a polynomial in cos0. In the case n — 2, for example, one has 
cos 26 = 2 cos 2 6 — 1, which means that T 2 (x) = 2x 2 — 1. The orthogonality 
is proved by making the same change of variable in the integral (T m ,T n ). 
One also finds that ||To || 2 = n and ||T n || 2 = ir/2 for n > 0. 

It can be proved that the polynomials named after Laguerre, Hermite, 
and Chebyshev actually constitute complete orthogonal systems in their 
respective spaces. 

We round off with a couple of examples. 

Example 5.13. Find the polynomial p{x) of degree at most 2, that min- 
imizes the value of the integral 

rOO 

/ |x 3 - p(x)\ 2 e~ x dx. 

Jo 



Solution. The norm occurring in the problem belongs together with the 
Laguerre polynomials. These even happen to be orthonormal (not merely 
orthogonal), which means that the wanted polynomial must be 



p(x) = (/, L 0 ) L 0 (x) + (/, Li) Li(x) + (/, L 2 ) L 2 (x), 



where f(x) = x 3 . From a handbook we fetch Lo(x) = 1, L\{x) = 1 — x, 
L 2 (x) = 1 — 2x+ \ x 2 . When computing the inner products it is convenient 

to notice that / 0 °° x n e~ x dx = n\. We get 




x 3 e x dx — 3! = 6, 
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/»oo 

(f,L 1 )= / 

Jo 

rOO 

{fM)= / 

Jo 



x 3 (l - x ) e~ x dx = 3! - 4! = 6 - 24 = -18, 
z 3 (l - 2x + \ x 2 ) e~ x dx = 3! - 2 • 4! + \ ■ 5! = 18. 



Thus, p(x) = 6Lo(*)-18Li(a;)+18L2(a:) = 6— 18(1— a?)+18(l— 2ac+§ x 2 ) = 
6 — 18x + 9x 2 . □ 



Example 5.14. Let f(x) = \/l — x 2 for ,r < 1. Find the polynomial 
p(x), of degree at most 3, that minimizes the value of the integral 

/j/M-pM 



Solution. This inner product belongs with the Chebyshev polynomials. Be- 
cause of “odd-even” symmetry, these have the property that a polynomial 
of even index contains only terms of even degree, and similarly for odd 
indices. Since / is an even function and the inner product itself has “even” 
symmetry, the wanted polynomial will only contain terms of even degree: 

P(X) - (T 0 , T 0 ) T ° (X) + (T 2 ,T 2 ) T2(x)> 

The data required are taken from a handbook: To (sc) = 1, T 2 (sc) = 2sc 2 — 1, 
and the denominators are found above (and in the handbook), so all that 
remains to be computed are the numerators: 

</,T„) = 2/^.1.^-*, 

(/, T 2 ) —2 f V^-x 2 -(2x 2 - 1 ) • - ■ dx =2 f (2x 2 - 1 ) dx = -§. 
Jo V 1 - x 2 Jo 

Substituting we get 

p{x) = \ • 1 - ( 2a;2 - 1) = j^(6 - 8x 2 + 4) = (5 - Ax 2 ). 



□ 



Exercises 

5.22 Show that the formula (5.5) defines a polynomial of degree n. 

5.23 Show that the Laguerre polynomials are orthonormal. (Hint: the same as 
for Exercise 5.17.) 

5.24 Show that the formula (5.6) gives a polynomial of degree n. 

5.25 Show that the Hermite polynomials are orthogonal and ||# n || 2 = n\ 2 n 
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5.26 Expand e x ^ in a Laguerre series; i.e., determine the coefficients c n in the 
formula 

oo 

e x ^ 3 ~ ^ c n L n (x ), x > 0. 

n = 0 

(The formula J 0 °° e~ at t n dt = n\/a n+1 may come in handy.) 

2 

5.27 Let f(x) = e x ( H(x + 1) — H(x — 1)) Approximate / with a polynomial 
p(x) of degree at most 3 so as to minimize the expression 




\f(x)-p(x)\ 2 e 



.2 

dx. 



5.28 Let f(t ) = y/1 — t 2 for \t\ < 1. Find a polynomial p(t) of degree at most 3 
that minimizes 

i 2 vrb- 

5.29 Approximate f(x) = \x\ on the interval (—1,1) with a polynomial of degree 
< 3, first with the weight function 1 and secondly with weight l/\/l — x 2 . 
Thirdly , do the same with weight function (1 — x 2 ) (but here you’ll have to 
construct your own orthogonal polynomials). Compare the approximating 
polynomials obtained in the three cases. Draw pictures! Comment! 



Summary of Chapter 5 

In this chapter we studied vector spaces where the scalars are the complex 
numbers. Practically all results from real linear algebra remain valid in this 
case. The only important exception to this is the appearance of the inner 
product 

A typical example of an inner product space is given by the set L 2 (7, w), 
where I is an interval on the real axis and w is a weight function , i.e., a 
function such that w(x) > 0 on /; the inner product in L 2 (I,w) is defined 
by 

(/, g) = J f(x)g(x) w(x) dx. 

With the norm defined by \\u\\ = \J{u,u ), a notion of distance is given by 
\\u — v||. 

Theorem 

In an inner product space the inequalities 

\(u,v)\ < IMHMI, \\u + v|| < ||tz|| + \\v\\ 

are valid. 

With respect to an inner product, one defines orthogonality and orthonor- 
mal sets (or ON sets), as in the real case. Gram-Schmidt’s method can be 
used to construct such sets. 
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If {y>k}k=i ls an ON set, the orthogonal projection of a u £ V on to the 
subspace spanned by this set is the vector 

N 

P ( u ) = ' 52 ( U ’ t Pk)‘Pk- 
k = 1 



Theorem 

If {<^i, (f2, • . . , v?jv} is an ON basis in an iV-dimensional inner product space 

N 

V, then every u £ V can be written as u = ( u i V 3)^31 and furthermore 

3 = 1 



one has 



N 



|u|| 2 -E | (u, (fj ) | : 2 (theorem of Pythagoras) . 
3 = 1 



For the inner product of two vectors one also has the following formula: 



N 

(u,v) = Y J (u,<Pj)(v,<Pj). 

3 = 1 

Theorem 

Let {(Pk}k=i be an orthonormal set in an inner product space V and let u 
be a vector in V. Among all the linear combinations $ = J 2 k = 1 IkVk, the 
one that minimizes the value of || u — $|| is the orthogonal projection of u 
on to the subspace spanned by the ON set, i.e., $ = P(u). Also, it holds 
that 

N 

u-'}T(u,(p k )(p k 

k = 1 



N 



Sl<u>¥>fc)| 2 



k = 1 



Theorem 

If {(fkjkLi is an ON set in V and u £ V, then 

00 

|(u, (fk)\ 2 < N| 2 (Bessel’s inequality). 

fc= 1 



If every element in V can be approximated arbitrarily closely by linear 
combinations of the elements of {<£&}, then this set is said to be complete 
in V. 

Theorem 

The system {(fk}^ 1 is complete in V if and only if for every u £ V it holds 
that 

00 

IMI 2 = ^l( u ^fe)l 2 

k = 1 

(the Parseval formula or the completeness relation). 
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Theorem 

If the system {(pk}^ =1 is complete in V, then every u £ V can be written 
as u = YlkLi( u i where the series converges in the sense of the norm, 



i.e., 



u ~Ylk=i( u ^ ( Pk)(fk\\ -> 0 as iV — >> oo). 



Theorem 

If the system {(pk}™^ is complete in V, then 



oo 

(u,v) = 

k = 1 



for all u, v £ V . 

If the set {ifk} is not ON but merely orthogonal, all these formulae must 
be adjusted by dividing each occurrence of a ipk by its norm. 

Theorem 

The two orthogonal systems {e' int } ne z and {cosnt, n > 0; sin nt, n> 1} 
are each complete in L 2 (T). 

As a consequence of this, Parseval’s identities hold for ordinary Fourier 
series (with conventional notation): 



1 

27T 



L 



\m\ 2 dt='£ 

n€Z 



^ J T 9 ® dt = 



^ ^ d n 5 

nez 



i r 00 

- / |/(t)| 2 * = i|a 0 | 2 + 5; ( l a »l 2 + l h »l 2 ) ’ 

71 n=l 

l r 00 

- f{t)g(t)dt=\ao^+^'{a n a^ + b n 0 n ). 

* J T 



Theorem 

The Fourier series of a function / £ L 2 (T) can always be integrated term by 
term over any bounded interval (a, b). The series obtained by this operation 
is always convergent, regardless of the convergence of the original Fourier 
series. 

Theorem 

(The Weierstrass approximation theorem) An arbitrary continuous func- 
tion / on a compact interval K can be approximated uniformly arbitrarily 
well using polynomials. 



Historical notes 

The insight that certain notions of geometry, such as orthogonality and projec- 
tions, can be fruitfully applied to sets of functions dawned upon mathematicians 
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in various situations during the nineteenth century. Round the turn of the cen- 
tury, the Swedish mathematician I var Fredholm (1866-1927) treated certain 
problems for linear integral equations in a way that made obvious analogies with 
problems for systems of linear equations. Building on these ideas, the German 
David Hilbert (1862-1943) and the Pole Stefan Banach (1892-1945) intro- 
duced notions such as Hilbert and Banach spaces. The L p spaces mentioned in 
the present text are all Banach spaces (if one uses the Lebesgue integral in the 
definitions); and in particular L 2 spaces are Hilbert spaces. The latter spaces are 
infinite-dimensional, complex-scalar counterparts of ordinary Euclidean spaces, 
with a concept of distance that is coupled to an inner product. 

Parseval’s formula, which can be seen as a counterpart of the theorem of 
Pythagoras, is named after an obscure French amateur mathematician, Marc- 
Antoine Parseval des Chenes (1755-1836). 

Adrien-Marie Legendre (1752-1833) was an influential French mathemati- 
cian who worked in many areas. Edmond Laguerre (1834-86) and Charles 
Hermite (1822-1901) were also French. Hermit e is most famous for his proof 
that the number 7r is transcendental. Pafnuty Lvovich Chebyshev (1821-94) 
founded the great Russian mathematical tradition that lives on to this day. 



Problems for Chapter 5 



5.30 Determine the Fourier series of the function 



f(x) 



COSX, 0 < X < 7T, 

— cos X, —7 r < x < 0. 



72 2 

Also compute the sum S = 7— - 

(4n 2 - l) 2 



5.31 Let / be the even function with period 27 t described by f(x) = sin | x for 
0 < x < 7r. Using the Fourier series of /, find the values of the sums 



si 



00 00 

S2 = £ 



(-1 r 



4n 2 - 9 ’ ^ 4n 2 - 9 ’ S3 ^ (4 n 2 - 9) 2 ' 

n= 1 n= 1 71=1 






5.32 Use the result of Problem 4.46 on page 103 to compute the value of 

oo 

y 1 

^ (2 n - l) 6 



5.33 Use the result of Problem 4.48 on page 103 to compute the value of £(8). 

5.34 Find a polynomial p(x) of degree at most 1 that minimizes the integral 




— x 2 ) 2 (l + x) dx. 




134 5. L 2 Theory 



5.35 Let Q be the square {(x,y) : \x\ < 7r, |y| < 7r}, and let L 2 (Q ) denote the 
set of functions / : Q — > C that satisfy ff Q \f(x,y)j 2 dxdy < oo. In this 
space we define an inner product by the formula 



if >9) = 




f(x, y)g(x,y)dxdy. 



Q 



Define the functions tp mn € L 2 (Q) by <p m n(x,y) = e t ( mx + n v) j m ,n 6 Z. 
Show that these functions are orthogonal with respect to (•,•), and deter- 
mine their norms in L 2 (Q). 

5.36 Expand the function f(x) = e~ ax in a Fourier-Hermite series: 



f(x) ~ C n H n (x). 

n—0 



5.37 Expand f(x) = x 3 , x > 0, in a Fourier-Laguerre series: 

oo 

f(x) ~ ^ C„L n (x). 

n = 0 



5.38 Prove this formula for Legendre polynomials: 

(2 n + 1 )P„(x) = Pn +1 (x) - Pn- l(x), n > 1. 



5.39 A function f(x), defined on (—1, 1), can be expanded in a Fourier-Legendre 
series: 

oo 

f(x) ~ ^2 CnPn(x). 
n—0 

What does the Parseval formula look like for this expansion? 

5.40 Determine the distance in L 2 (— 1, 1) from sin to to the subspace spanned 
by 1, x , x 2 . (The distance is the norm of the residual.) 

5.41 The functions 1 and \/3 (2x — 1) constitute an orthogonal system in the 
space L 2 (0, 1). Find the linear combination of these that is the best ap- 
proximation of cos a; in L 2 (0, 1). 

5.42 / is continuous on the interval [0, 1]. Moreover, 

f f(x) x n dx = 0, n = 0,l,2,...,. 

Jo 

Prove that f(x) = 0 for all x in [0, 1]. 

5.43 Determine the coefficients a^, k = 0,1, 2, 3, so that the integral 




+ a\x + aix 2 + CL 3 X 3 — x 4 | 2 dx 



is made as small as possible. 
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5.44 Let f(x) = cos to. Let V be the space of continuous functions on the 
interval [—1,1] with the inner product 



(u,v 




u(x) v(x) dx. 



M is the subspace in V consisting of polynomials of degree at most 3. Find 
the orthogonal projection of / onto M. 

5.45 Determine the numbers a, b och c so as to make the expression 




2 



(a + bx + cx 2 ) 



dx 



as small as possible. 

5.46 Let f(x) = sgnx = 2H(x) — l for x E R. Approximate / with a third-degree 
polynomial in the sense of Hermite. 

5.47 Let f(x) ~ n - ~ 2 



(1 — x 2 ) 3 / 2 . Find a polynomial P(x) of degree at most 3 that 



minimizes 



/ 



1 |/(x)-P(x)| 

\/l — X 2 



dx. 




6 

Separation of variables 



6.1 The solution of Fourier’s problem 

We now return, at last, to the problem stated in Sec. 1.4: heat conduction 
in a rod of finite length, with its end points kept at temperature 0. The 
mathematical formulation of the problem was this: 

(E) u xx = u t , 0 < x < 7r, t> 0; 

(B) u( 0, t) = u(n, t) = 0, t > 0; (6.1) 

(I) u(x, 0) = /(#), 0 < X < 7T. 

We had found the following solutions of the homogeneous sub-problem 
consisting of the conditions (E) and (B): 

N 

u(x,t) = ^6 n e _n 1 sin nx. (6.2) 

n= 1 

Then we asked two questions: can we allow N -> oo in this sum? And can 
the coefficients be chosen so that (I) is also satisfied? Now we can answer 
these questions. 

Let f(x) be the initial values for 0 < x < ir. By defining f(x) = —f(—x) 
for —7 r < x < 0 we get an odd function. It can be expanded in a Fourier 
series, which is a sine series with coefficients 

2 r 

b n = — f(x)sinnxdx. 

ft Jo 
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The coefficients are bounded, \b n \ < M, they even tend to zero as n — )► oo. 
This implies that the series 

oo 

u(x,t) = b n e~ n 1 sin nx (6.3) 

71=1 

converges very nicely as soon as t > 0. If a > 0, we can estimate the terms 
like this for t > a : 

\b n e~ n 1 sinnx\ < M e~ n a < M e~ na = M n , 

and is a convergent geometric series. The considered series then 

converges uniformly in the region t > a. If it is differentiated termwise 
with respect to t once, or with respect to x twice, the new series are also 
uniformly convergent in the same region (check this!). According to the 
theorem on differentiation of series, the function u, defined by (6.3), is 
differentiable to the extent needed, and since all the partial sums satisfy 
(E)+(B), so will the sum. 

To check that the initial values are right is somewhat more tricky. If / 
happens to be so nice that ^2\b n \ < oo, then we are home; for in this case 
the series will actually converge uniformly in the closed set 0 < x < 7r, 
t > 0, and so the sum is continuous in this set, making 



oo 




(cf. Theorem 4.2, page 83). This holds, say, if the odd, 27r-periodic extension 
of / belongs to C 2 ; it even holds under weaker assumptions, but this is 
harder to prove. 

If / G L 2 (0,7r), we can alternatively study convergence in the L 2 sense. 
Let v t be the restriction of u to time £, i.e., Vt(x) = u(x,t), 0 < x < n 
(here the subscript t does not stand for a derivative). The function vt has 
Fourier coefficients b n e~ n *, and by Parseval we have 



p 7f CXJ 

INI 2 = / \vt{x)\ 2 dx = lY,\b n \ 2 e~ 2nH <^Y j \b n \ 2 = ||/|| 2 <oo. 

^ 71=1 ^ 71=1 

Thus, vt also belongs to L 2 (0,7r) for each t > 0. Now we investigate what 
happens if t \ 0: 



11/ “t*| 



7 r 



£|M 2 (l-e -" 2t ) 2 = *(*). 



71=1 



The series defining $>(t) converges uniformly on t > 0 and its terms are 
continuous functions of t. Thus $(t) is continuous on the right for t = 0, 
and 

lim$(t) = $(0) = 0, 

t\ o 
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which means that ||/ — v t \\ — > 0 as t decreases to 0. The solution u thus 
has the L 2 -limit /, which is our way of saying that 



lim 

t\o 



f 



Hx,t)-f(x) I 2 



dx — 0. 



The terms of the series representing the solution consist of sine functions, 
multiplied by exponentially decreasing factors. The higher the frequency of 
the sine factor, the faster does the term containing it tend to zero - small 
fluctuations in the temperature along the rod are faster to even out than 
fluctuations of longer period. As time goes by, the temperature of the entire 
rod will approach zero - which should be expected, considering the physical 
experiment that we have attempted to describe with our model. 

Remark. For t > 0, the series in (6.3) can actually be differentiated an in- 
definite number of times with respect to both variables. What happens to the 
term b n e~ n 1 sin nx when it is differentiated is that one or more factors n come 
out, that sin and cos may interchange and also the sign may change. But, for 
t > a > 0, the resulting term can always be estimated by an expression of the 
form Mn p e~ n t < M n p e~ na = Q n , and it is easy to see that ^2Q n < oo 
(apply the ratio test). We conclude that all functions such as (6.3) are indeed of 
class C °° ; they are “infinitely smooth.” □ 



Exercises 

6.1 Find the solution of Fourier’s problem when (a) f(x) = sin 3 x for 0 < x < 7r; 
(b) f(x) = cos3x for 0 < x < 7r. 

6.2 Find a solution to the following modified Fourier problem (heat conduction 
in a rod of length 1; a is a positive constant): 

u t = \u xx , 0 < x < 1, t > 0; 
a 2 

u(0,t) = u(l,t) = 0, t > 0; tt(x,0) = /(x), 0 < x < 1. 

6.2 Variations on Fourier’s theme 

In this section we perform some slight variations on the theme that has just 
been concluded. Later on in the chapter we shall indicate the possibility of 
more far-reaching variations. 

Example 6.1. Let us study the problem of heat conduction in a completely 
isolated rod, where there is no exchange of heat with the surroundings, not 
even at the end points. As before, the rod is represented by the interval 
[0, 7r] , and the temperature at the point x at time t is denoted by u(x,t). 
Within the mathematical model that gives rise to the heat equation, the 
flow of heat is assumed to run from warmer to colder areas in such a way 
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that the velocity of the flow is proportional to the gradient of the temper- 
ature (and having the opposite direction). The mathematical formulation 
of the condition that no heat shall flow past the end points is then that the 
gradient of the temperature be zero at these points; in the one-dimensional 
case this condition is simply 1^(0, £) = u x (n,t) = 0. If the temperature of 
the rod at time 0 is called /(re), we have the following problem: 

(E) u xx = uu 0 < x < 7r, t> 0; 

(B) u x ( 0, t) = u x ( 7r, t) = 0, t > 0; (6.4) 

(I) ^(x, 0) = f{x), 0 < X < 7T. 

This problem is largely similar to the previous one, and we attack it 
by the same means (cf. Sec. 1.4). Thus we start by looking for nontrivial 
solutions of the homogeneous sub-problem (E)+(B), and we try to find 
solutions having the form u(x,t) = X(x)T(t). Substituting into (E) leads, 
just as before, to the separated conditions 

X"(x) + \X(x) = 0, T'{t) + A T(t) = 0. 

To satisfy (B) without having u identically zero we must also have X'(0) = 
X f (n) = 0. This leaves us with the following boundary value problem for 
X: 



X"{x) + XX(x) = 0, 0 < £ < 7r; X'(0) = X'(v) = 0. (6.5) 

Just as in Sec. 1.4, we look through the different cases according to the 
value of A. It will be sufficient for us to give account of “basis vectors,” so 
we omit scalar factors that can always be adjoined. 

For all A < 0 one finds that the only possible solution is X(x) = 0 
(the reader should check this). If A = 0, the equation is X"(x) = 0 with 
solutions X(x) = A + Bx. The boundary conditions are satisfied if B = 0. 
This means that we have the solutions X(x) = Xq(x) = A = constant. For 
the same value of A, the T-equation also has the solutions T = constants; 
as a “basis vector” we can choose 

uo(x,t) =X 0 (x)T 0 (t) = (6.6) 

When A > 0 we can put A = a; 2 with u > 0. Thus we have X n + lj 2 X = 0 
with solutions X(x) = Acosujx + Bsinux and X'(x) = — uj A sin wx + 
ujB cos ux. The condition X'(0) = 0 directly gives B = 0, and then X f (n) = 
0 means that 0 = -aMsinu;7r. This can be satisfied with 4^0 precisely 
if u is a (positive) integer. Thus, for A = n 2 we have the solution X(x) = 
X n (x) = cos nx and multiples of this function. The corresponding equation 
for T is solved by T n (t) = e~ n2t . In addition to (6.6), the problem (E)+(B) 
thus has the solutions 



u n (x, t) = X n (x) T n (t) = e nH 



cosnx, n = 1,2,3, 
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By homogeneity, series of the form 



oo 

u(x, t) = | ao + ^ a n e~ n 1 cos nx (6.7) 

n=l 

are solutions of (E)+(B), provided they converge nicely enough. It remains 
to be seen if it is possible to choose the constants a n so that (I) can be 
satisfied. Direct substitution of t = 0 in the solution would give 

oo 

f(x) = u(x, 0) = ^ ao + ^ a n cos nx, 0 < x < n. 

n — 1 

We can see that if / is extended to an even function on the interval ( — 7r, 7 r) 
and we let the a n be the Fourier coefficients of this function, then we ought 
to have a solution to the whole problem. 

And, just as in the preceding section, everything works excellently if we 
know, for example, that \ a n\ < oo. 

It can be noted that the solution (6.7) has the property that all terms 
except for the first one tend rapidly to zero when t tends to infinity. One 
is left with the term | ao . As we have seen, this is equal to the mean value 
of /, and this is in accordance with the intuitive feeling for what ought to 
happen in the physical situation: a completely isolated rod will eventually 
assume a constant temperature, which is precisely the mean of the initial 
temperature distribution. □ 

Example 6.2. Let us now modify the original problem in a few different 
ways. We let the rod be the interval (0,2), and the end points are kept 
each at a constant temperature, but these are different at the two ends. 
To be specific, say that u(0,t) = 2 and u(2,£) = 5. Let us take the initial 
temperature to be given by f(x) = 1 — x 2 . The whole problem is 

(E) 'Uxx = 0 x 2, t 0; 

(B) u(0,t) = 2, u(2,t) = 5, t > 0; (6.8) 

(I) u(x, 0) = 1 — x 2 , 0 < x < 2. 

Here, separation of variables cannot be applied directly; an important 
feature of that method is making use of the homogeneity of the conditions, 
enabling us to add solutions to each other to obtain other solutions. For this 
reason, we now start by homogenizing the problem in the following way. 
Since the boundary values are constants, independent of time, it should be 
possible to write u(x,t) = v(x,t) + <p(x), where <p(x) should be chosen to 
make v the solution of a modified problem with homogeneous boundary 
conditions. Substitution into (E) gives 



Vxx(x,t) + tp"(x) = V t (x,t), 
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so it is desirable to have ip f, (x) = 0. If we can also achieve <£>(0) = 2 and 
tp( 2) = 5, we would get v(0,t) = v(2 , £) = 0. 

Thus we are faced with this simple problem for an ordinary differential 
equation: 

ip"(x) = 0; <p(0) = 2, <p( 2) = 5. 

The unique solution is easily found to be <p(x) = | x + 2. Substituting this 
into the initial condition of the original problem, we have 

1 - x 2 = u(x, 0) = v(x, 0) + ip(x) = v(x, 0) + | x + 2. 

We collect all the conditions to be satisfied by v: 

(E') v xx = v t , 0 < x < 2, t> 0; 

(B') v(0, t) = 0, v(2, t) = 0, t > 0; (6.9) 

(I') v(x, 0) = -x 2 — | x — 1, 0 < x < 2. 



This problem is essentially of the sort considered and solved in Sec. 1.4 and 
6.1. A slight difference is the fact that the length of the rod is 2 instead of 7 r, 
but the only consequence of this is that the sine functions in the solution will 
be adapted to this interval (as in Sec. 4.5). The reader is urged to perform 
all the steps that lead to the following formula for “general” solutions of 
(E'H(B'): 

oo 

v(x,t) = ^a n exp 

n— 1 

Next, the coefficients are adapted to (I'): 



n 2 7T 2 \ . nir 
— - — t sin — x. 
4/2 



2 f 2 , 2 3 1\ ■ n7r 

= 2 /o Sm ~2 



xdx 



16 (— 1 )" 



nn 



2 16(1 -(-!)”) 

I Q Q 



Finally, we put together the answer to the original problem: 



u(x,t) 



§*+ 2 +E 

n = 1 



16(— l) w — 2 16(1 - (-l) n ) 

nn n 3 7r 3 



n 2 7r 2 t/4 



. nn 
sm x. 
2 



As time goes by, the temperature along the rod will stabilize at the distri- 
bution given by the function ip(x). This is called the stationary distribution 
of the problem. □ 



Example 6.3. In our next variation we consider a rod with a built-in 
source of heat. The length of the rod is again 7r, and we assume that at the 
point with coordinate x there is generated an amount of heat per unit of 
time and unit of length along the rod, described by the function sin(x/2). 
It can be shown that this leads to the following modification of the heat 
equation: 

x 

n't — n xx + sin ~ , 



(E) 



0 < X < 7T, t > 0. 
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We also assume that both ends are kept at temperature 0 for t > 0 and 
that the initial temperature along the rod is 1: 

(B) u(0,t) = u(n,t) = 0, t > 0; (I) u(x, 0) = 1, 0 < x < tt. 

Here there is an inhomogeneity in the equation itself. We try to amend this 
by using the same trick as in Example 2: put u(x,t) = v(x,t) + p{x) and 
substitute into (E) and (B). (Do it!) We conclude that it would be very 
nice to have 

x 

<p"{x) = - sin <p(0) = (f(ir) = 0. 

The first condition implies that ip must be of the form ip(x) = 4sin(x/2) + 
Ax+B, and the boundary conditions force us to take B = 0 and A = —4/7 r. 
As a consequence, v shall be a solution of the problem 

(E') V xx = v t , 0 < X < 7r, t> 0; 

(B') v(0, t) = 0, v{i r, t) = 0, t> 0; (6.10) 

(!') v(x,0) = 1 — 4sin(x/2) + (4x)/tt, 0 < x < n. 



The reader is asked to complete the calculations; the answer is 



A n OO 

/ i\ . . x 4 2 

ulx, t) = 4 sm x H — > 

v ’ ' 9 7r t r 



n — 1 



1 — (— l) n (4n 2 — 5) _ n 2 t . 

—z — e n sm nx. 

n(4n 2 - 1) 



□ 



Example 6.4. We leave the heat equation and turn to the wave equation. 
We shall solve the problem of the vibrating string . 

Imagine a string (a violin string or guitar string), stretched between the 
points 0 and 7 r of an x-axis. The point with coordinate x at time t has 
a position deviating from the equilibrium by the amount u(x,t). If the 
string is homogeneous, its vibrations are small and considered to be at 
right angles to the x-axis, gravitation can be disregarded; and the units of 
mass, length, and time are suitably chosen, then the function u will satisfy 
the wave equation in the simple form u xx = u t t • The fact that the string 
is anchored at its ends means that u(0,t) = u(7r,t) = 0. At time t — 0, 
every point of the string is located at a certain position and has a certain 
speed of movement. We want to find u(x, t) for t > 0 and all the interesting 
values of x. This is collected into a problem of the following appearance: 



(E) 


u xx = utu 0 < x < 7r, t > 0; 




(B) 


7/(0, t) = u( 7r, t) = 0, t > 0; 


(6.11) 


(Ii) 


tfc(x, 0) = f(x), 0 < X < 7T, 


(I 2 ) 


ut(x, 0) = #(x), 0 < x < 7r; 




Again, (E) and (B) are 


homogeneous conditions. The usual attempt 


u{x,t) 



= X(x) T(t ) this time leads up to this set of coupled problems: 

f X"(x) + AX(x) = 0, 

\X(0) = X(7r) = 0; 



T"(t) + \T(t) =0. 
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The X problem is familiar by now: it has nontrivial solutions exactly for 
A = n 2 (n = 1, 2, 3, . . .), viz., multiples of X n (x) = sin nx. For these values 
of A, the T problem is solved by T n (t) = a n cos nt + b n sinnt. Because of 
homogeneity we obtain the following solutions of the sub-problem (E)-j-(B): 

oo oo 

u(x,t) = ' s ^2 / X n {x)T n (t) = ^(a n cosn£ + & n sinnt)sinnx. (6.12) 

71 = 1 71=1 

Letting t = 0 in order to investigate (Ii), we get 

oo 

f(x) — u(x, 0) = ^2 a n s i n nx - 

71=1 

Termwise differentiation with respect to t and then substitution of t = 0 
gives for the second initial condition (I2) that 

00 

g(x) = ut(x, 0) = y^ nb n sin nx. 

71=1 

Thus, if we choose a n to be the sine coefficients of (the odd extension of) 
/, and choose b n so that nb n are the corresponding coefficients of <7, then 
the series (6.12) ought to represent the wanted solution. 

As we saw already in Sec. 1.3, the wave equation may have rather irreg- 
ular, non-smooth solutions. This is reflected by the fact that the series in 
(6.12) can converge quite “badly.” See, for example, the solution of Exer- 
cise 6.7, which is, after all, an attempt at a quite natural situation. If we 
allow distributions as derivatives, as indicated in Sec. 2.6-7, the mathemat- 
ical troubles go away. It should also be borne in mind that the conditions 
of Exercise 6.7 are not physically realistic: a string does not really have 
thickness 0 and cannot really take on the shape of an angle. □ 

Remark. The typical term in the sum (6.12) can be rewritten in the form 
A n sm(nt + a n )smnx. Its musical significance is the nth partial tone in the 
sound emitted by the string. (The first partial is often called the fundamen- 
tal.) Figure 6.1 illustrates in principle the shapes of the string that correspond to 
different values of n. These are also called the modes of vibrations of the string. 
□ 

Remark. Of considerable musical importance is the fact that the nth partial 
also vibrates in time with a frequency that is the nth multiple of the fundamental. 
This is what was noted already by PYTHAGORAS: if the length of the string is 
halved (making it vibrate in the same manner as the whole string would vibrate 
in the second mode), one hears a note sounding one octave higher. The successive 
partials are illustrated in Figure 6.2. The accidental J? stands for lowering the pitch 

by slightly more than a (tempered) semi-tone, while J and jf indicate raising the 
pitch by slightly less or respectively more, than a semi-tone. Partial number 7 is 
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71=1 



71=2 



71=3 




71=4 



71=5 



FIGURE 6.1. 




71=6 




FIGURE 6.2. 

wellknown to musicians (especially brass players) as an ugly pitch that is to be 
avoided in normal music. Partials 11 and 13 are also bad approximations of the 
pitches indicated in the figure, but they are so high up that they cause relatively 
little trouble in normal playing. □ 

We round off this section with a problem for the Laplace equation in a 
square. This sort of problem is called a Dirichlet problem: the Laplace 
equation in a region of the plane, with values prescribed on the boundary 
of the region. 

Example 6.5. Find u(x,y) that solves u xx + u yy = 0, 0 < z < 7 r, 0< 
y < 7r, with boundary conditions u(x, 0) = sin 3# - 3sin2x for 0 < x < 7 r, 
u(x, 7r) = u(0, y) = u(ir, y) = 0, 0 < x, y < n. 

Solution . Draw a picture! We have a homogeneous equation , 

(E) 'U'xx “I” V"yy “ 

together with three homogeneous boundary conditions , 

(Bi, 2 , 3 ) ^(0, y) = u(n, y) = 0, u(x, tt) = 0, 
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and one non-homogeneous boundary condition , 

(B 4 ) u(x, 0) = sin 3x — 3 sin 2x. 

We begin by disregarding (B 4 ) and look for nontrivial functions u of the 
special form u(x,y) = X(x)Y(y) satisfying the homogeneous conditions. 
Substitution into (E) gives 

X"(x)Y(y) + X(x)Y"(y) = 0, 

which can be separated to look like 

X"(x) = Y"(y) 

X(x) Y(y) ’ 

and by the same argument as in preceding cases we conclude that the two 
sides of this equation must be constant. This constant is (again by force of 
tradition) given the name —A. The boundary conditions (B^) can be met 
by saying that X(0) = X ( 7 r) = 0, and (B 3 ) by putting Yfrj = 0. We find 
that we have the following couple of problems for X and Y: 

f *"(*) + AX(x) = 0 / Y"(y) - A Y(y) = 0 

\X(0)=X{7r) = 0 \F(7t) = 0 

The problem for X is, by now, wellknown. It has nontrivial solutions if 
and only if A = n 2 for n = 1, 2, 3, . . ., and these solutions are of the form 
X n (x) = sin nx. For the same values of A, the Y problem is solved by 
Y n (y) = Ae ny +Be~ ny , where A and B shall be chosen to meet the condition 
Y n (7r) = 0. This is done by letting B = —Ae 2nn . We thus have the solutions 

u n (x,y) = A n (e ny — e n ^ 2lT ~ y>) ) sinnx, n = 1,2,3, ... , 

of the homogeneous conditions (E) and (Bi ? 2 , 3 ). Because of the homogene- 
ity, sums of these solutions are again solutions. A “general” solution is given 
by 

00 

u(x,y) = A n ( e ny — e n ^ 27T ~ y ^) sinnx. 

n= 1 

We now have to choose the coefficients A n to meet the remaining condition 
(B 4 ). The reader should check the computations that lead to the final result 

u(x,y) = - — ~ 4 ^( e<2y ~ e 2 ( 2ir - y ^) sin 2x + - — — e 3 ^ 27r ~ y ^) sin 3a;. 



□ 

In the last example, the boundary condition was homogeneous on three of 
the edges of the square. A general Dirichlet problem for a square might be 
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taken care of by solving four problems of this kind, with non-homogeneous 
boundary values on one edge at a time, and adding the solutions. 

In the exercises the reader will have the opportunity to apply the basic 
ideas of the method of separation of variables to a variety of problems. In 
all cases, the success of the method is coupled to the fact that one reaches 
a problem for an ordinary differential equation together with boundary 
conditions. This problem turns out to have nontrivial solutions only for 
certain values of the “separation constant,” and these solutions are a sort of 
building blocks out of which the solutions are constructed. This sort of ODE 
problem is called a Sturm-Liouville problem and will be considered in 
Sec. 6.4 for its own sake. 

It is even possible to treat partial differential equations with more than 
two independent variables in much the same way. 

Exercises 

6.3 Find a solution of the heat problem Ut = u xx for0<x<7r,t>0, such 
that u x (0,t) = u x (n,t) = 0 for t > 0 and u(x, 0) = |(1 -f cos3x) for 
0 < X < 7T. 

6.4 Determine a solution of the problem 

{ u xx — tut, 0 < x < 7r, t > 1; 
u(0, t) = u(tt, t) = 0, t > 1, 
u(x, 1) = sin z + 2 sin 3x, 0 < x < 7r. 

6.5 Find a solution of the non-homogeneous heat conduction problem 

{ u xx ~ u t + sinx, 0 < x < 7T, t > 0; 
u( 0, t) = u(tt , t) = 0, t > 0; 
u(x, 0) = sinx + sin2x, 0 < x < it. 

6.6 Solve the following problem for the vibrating string: 

{ u xx = utt , 0 < x < 7r, t > 0; 

u(x, 0) = 3sin2x, ut(x, 0) = 5 sin 3a:, 0 < x < 7r; 

u( 0, t) = u(7r, t) = 0, t > 0. 

6.7 The plucked string: a point on the string is pulled from its resting position 
and then released with no initial speed. If the string is plucked at its middle 
point, what tones are heard? In a mathematical formulation: Solve the 
problem (6.11), when / is given by 

f(x) = ax , 0 < X < | 7T, f(x) = a(7T — x), | 7T < X < 7T, 

and g(x) = 0. 

6.8 Find u(x, t) if 

{ u xx (x,t ) = 0 < x < 1, t > 0; 

u(0, t) = u( 1, t) = 0, t > 0; 

u(x, 0) = sin37rx, u t (x, 0) = sin tocos 2 ttx, 0 < x < 1. 
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6.9 Find a solution of the following problem (one-dimensional heat conduction 
with loss of heat to the surrounding medium) for h > 0 constant: ut = 
u xx — hu , 0 < x < 7r, t > 0, together with u(0, t) — 0, u( 7r, t) = 1 for t > 0 
and u(x , 0) = 0 for 0 < x < tt. 

6.10 A Dirichlet problem: u xx + u yy = 0 for 0 < x, y < 1, u(x , 0) = 1) = 0, 

u(0, y) = 0 and u( 1, y) = sin 3 7n/. 

6.11 Find a solution u = u(x , t) of this problem: 

{ u xx + | u = u tl 0 < x < 7r, t > 0; 

u(0,£) = 0, u(7r,t) = 1, t > 0; 

0) = 0, 0 < X < 7T. 

6.3 The Dirichlet problem in the unit disk 

We shall study a problem for the Laplace equation in two dimensions. Let 
u = u(x,y ) be a function defined in an open, connected set in R 2 . The 
Laplace equation in Q, is 

d 2 u d 2 u . . 

Au: ~dx^ + df~ 0, 

The solutions of this equation are called harmonic functions in The 
Dirichlet problem is the task of finding all such functions with prescribed 
values on the boundary dQ. We shall study this problem in the case when 
is the unit disk D : x 2 + y 2 < 1, so that the boundary <9f) is the unit 
circle T : x 2 + y 2 = 1. Concisely, the problem is 

A u(x,y) = 0, (x,y) G D; u(x,y) = g(x,y) = known function, (x, y) G T. 

(6.13) 

We shall describe two lines of attack: first a method that requires know- 
ledge of the elementary theory of analytic (or holomorphic) functions, then 
a different approach involving separation of variables. 

Method 1. Interpret (x,y) as a complex number z = re l6 . The boundary 
function g can then conveniently be considered as a function of the polar 
coordinate 0, so that we are looking for harmonic functions u = u[z ) = 
u(re l9 ) for r < 1 with boundary values 

u(e ld ) = lim u(re l6 ) = g(9 ), —tt < 0 <it. 

The unit disk D is simply connected. By the theory of analytic functions, 
every harmonic function u in D has a conjugate-harmonic partner v such 
that the expression f(z ) = u{z) -f iv(z) is analytic in D. An analytic func- 
tion in the unit disk is the sum of a power series: 

oo oo 

u(z) + iv{z) = f(z) = A n z n = ^2 A rJ n e me 

71=0 71=0 
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oo 

= ^ (B n -f iC n ) r n (cos n6 + i sin n9) 

71=0 

OO 

= ^2 rTl ((Bn cos n6 — C n sin n6) + i(C n cos n6 + B n sin n9 )) , 

71=0 

where the A n are Taylor coefficients with real parts B n and imaginary parts 
C n . Taking the real part of the whole equation one sees that u{z ) must be 
representable by a series of the form 

00 

u(z) = u(re ld ) = ^2 rn (Bn cos nO - C n sin nQ) , 

71=0 

and, conversely, one realizes (by reading the equation backward) that all 
such series represent harmonic functions (provided the corresponding series 
Y^A n z n converges in D). 

In order to make the formula neater, we switch letters: put a n = B n and 
b n — — C n for n > 1, ao = 2£?o, and note that the value of Co is immaterial 
(since sin 06 = 0 for all 0), and we get 

00 

u(re ld ) = \ ao + ^ r n ( a n cos n9 + b n sin nO ) . 

71=1 



Method 2. We want to find solutions of the Laplace equation in the region 
described in polar coordinates by r < 1, —tv < 9 < tt. First we transform 
the equation into polar coordinates. Using the chain rule one finds that 
A u = 0 (for r > 0) is equivalent to 

d ( du\ d 2 u 

r siVYr) + am =(l 



(the computations required are usually carried through in calculus text- 
books as examples of the chain rule). We then proceed to find nontrivial 
solutions of the special form u(r,9) = ij(r)0(0), i.e., solutions that are 
products of one function of r and one function of 9. Substitution into the 
equation results in 

r ^ ( r W 9(0) ) + W = ° ^ r[rR')'& = —RQ". 



We divide by RQ and get 



1 



R r 



(rR’) 1 = 



©" 



The left-hand member in this equation is independent of 9 and the right- 
hand member is independent of r. Just as in Sec. 1.4 we can conclude that 




150 



6. Separation of variables 



both members must then be constant, and this constant value is called A. 
The situation splits into the two ordinary differential equations 

e" + A0 = O, r(rR')' — \R = 0. 

In addition, there are a couple of “boundary conditions” : in order that the 
function u = RQ be uniquely determined in D, the function 0(0) must 
have period 2n. Furthermore, since u shall have a finite value at the origin, 
we demand that R(r) have a finite limit i?(0+) as r \ 0 . 

We begin with the problem for 0: 

/ 0"(0) + A0(0) = 0, 

\ 0(0 + 2tt) = 0 ( 0 ) for all 0. 

As in Sec. 1.4, we work through the cases A < 0, A = 0 and A > 0. In 
the first case there are no periodic solutions (except for 0(0) = 0). In 
the case A = 0, all constant functions will do: 0o(0) = Aq. When A > 0, 
let A = u 2 with u > 0, and we find the solutions 0^(0) = A u coscj0 + 
Bu sinu;0. These have period 2ir precisely if a; is a positive integer: u = n, 

n = 1, 2, 3, Summarizing, we have found interesting solutions of the 0 

problem precisely when A = n 2 , n — 0,1,2, — 

For these A we solve the i?-problem. When A = 0, the equation becomes 

r(rR')' = 0 rR' = C «=» R' = j R = C\nr + D. 

The value R( 0+) exists only if C = 0. In this case we thus have a solution 
u = uo = 0o(0)i?o(O = Ao • D = constant. For reasons that will presently 
become evident we denote this constant by 

When A = n 2 with n > 0 we have a so-called Euler equation: 

r(rR f )' = n 2 R r 2 R" + rR' - n 2 R = 0. 

To solve it, we change the independent variable by putting r = e s , which 
results in 




which has the solutions R = Ce ns + De~ ns = Cr n + Dr~ n . We must take 
D = 0 to ascertain that R( 0+) exists. Piecing together with 0 we arrive 
at the solution 

u{re %e ) — u n (re td ) = 0 n (0) R n (r) = C n r n (A n cos n0 + B n sinn0) 

= r n ( a n cos nO + b n sin n6 ) . 

The Laplace equation is homogeneous. Assuming convergence for the 
series, the solutions of A u = 0 in the unit disk should be representable by 
series of the form 

oo 

u(re l9 ) = | ao + ^ r n (a n cos nO + b n sin nO) . 

n = 1 
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This is the same form for harmonic functions in D as obtained by “Method 
1” above. 

The formal solutions that we have obtained can, of course, also be written 
in “complex” form. Via Euler’s formulae we find 

u(r,0) = u(re ie ) = ^ c n r^e m(9 . (6-14) 

n<EZ 



Notice that the exponents on r have a modulus sign. 

Now we turn to the boundary condition. As r 1, we wish that the 
values of the solution approach a prescribed function g(9). For simplicity, we 
assume that g is continuous. Let the numbers c n be the Fourier coefficients 
of g: 

Cn = h j T g{d)e ~ ined6 ' 

and, using these coefficients, form the function u(r, 6) as in (6.14). By 
Lemma 4.1, there exists a number M such that \c n \ < M. Using the Weier- 
strass M - test we can easily conclude that the series defining u converges 
absolutely and uniformly in every inner closed circular disk r < r*o, where 
ro < 1, and this still holds after differentiations with respect to r as well as 
6. According to the theorem on differentiation of series, termwise differenti- 
ation is thus possible, and since each term of the series satisfies the Laplace 
equation and this equation is homogeneous, the sum function u will also 
satisfy the same equation. Now we turn to the boundary condition. The 
uniform convergence in r < ro allows us to interchange the order of sum 
and integral in the following formula: 



u{r,6) = ^2 c n rW e ind = 

nGZ 



S ’- 1 " 1 

tiG Z 





dt 




g(t) dt. 



The sum in brackets can be computed explicitly - it is made up of two 
geometric series: 



-l 



^ r l n le <na = ^ r _n e ins + ^ r n e ir ' 



ne z 



n=— oo 
oo 



71=0 



: r n e~ ins +Vr” e ins = — + 



n= 1 



71=0 



1 — re 1. 



(1 — re ts )re ls + 1 — re 
|1 — re is | 2 



1 — r 2 



1 _l_ r 2 _ 2r cos s * 
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We define the POISSON kernel to be the function 



Pr(s) = P(a, r) = 2- V" r-l" 1 e ins 

Z7T 

tiGZ 



1 1 — r 2 

2n 1 + r 2 — 2r cos s * 



This gives the following formula for u: 

u(r,0) = f P r (9 — t) g{t) dt = f P r (t) g(6 — t) dt. 

J T J T 

The Poisson kernel has some interesting properties: 



1. 

2 . 

3. 



P r (s) = P r (~s ) > 0 for r < 1, s E T. 
f T P r (s) ds = 1 for r < 1. 



If (J > 0, then 



lim 




ds = 0. 



The proofs of 1 and 2 are simple (2 follows by integrating the series term 
by term, which is legitimate). The property 3 can be shown thus: since 
P r (s) is decreasing as s goes from 0 to i r, we have P r {s) < P r (S) on the 
interval, and 

/*7T /»7T 

/ P r (s) ds < P r {6) / ds = (tt — S)P r (S) -> 0 as r 1. 
i(5 

This sort of properties of a collection of functions should be familiar to the 
reader. They actually amount to the fact that P r is a positive summation 
kernel of the kind studied in Sec. 2.4. The only difference is the fact that 
the present kernel is “numbered” by a variable r that tends to 1, instead 
of using an integer N tending to infinity. Theorem 2.1 can be used, and we 
get the result that we have constructed a solution of the Dirichlet problem 
with boundary values g{6) in the sense that 



lim u(r, 6) = g{6) 

r/*l 



at all points 0 where g is continuous. 

In addition, the solution is actually unique. This can be proved using 
a technique similar to that employed at the end of Sec. 4.2. First one 
proves that the problem with boundary values identically zero has only the 
solution identically zero, and then this is applied to the difference of two 
solutions corresponding to the same boundary values. 

Remark. We have here touched upon another method of summing series that 
may not be convergent. It is called Poisson or Abel summation. For a numerical 
series 0,71 ^ cons i s f s i n forming the function 

oo 

f(r) = 

71=0 



0 < r < 1. 
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If this function exists in the interval indicated, and if it has a limit as r /* 1, then 
this limit is called the Poisson or Abel sum of the series. It can be proved that this 
method sums a convergent series to its ordinary sum. It is also a quite powerful 
method: it is stronger than Cesaro summation in the sense that every Cesaro 
summable series is also Abel summable; and there exist series summable by Abel 
that are not summable by Cesaro, not even after any number of iterations. □ 

Example 6.6. Find a solution of the Dirichlet problem in the disk having 
boundary values u(l,0) = cos 40 — 1. Express the solution in rectangular 
coordinates! 

Solution. It is immediately seen that in polar coordinates the solution must 
be 

u(r, 0) = — 1 + r 4 cos 4 0. 

We rewrite the cosine to introduce cos and sin of the single value 0: 

u = — 1 + r 4 ( cos 2 20 — sin 2 26) 

= — 1 + r 4 ((cos 2 6 — sin 2 6) 2 — (2 sin 6 cos 6) 2 ) 

= -1 + r 4 (cos 4 6 — 2 cos 2 6 sin 2 6 + sin 4 6 — 4 sin 2 6 cos 2 6) 

= -1 + x 4 - 6 x 2 y 2 + y 4 . 



□ 



Exercises 

6.12 Find a solution of the Dirichlet problem in the unit disk such that u(e %e ) = 
2 + cos 30 + sin 40. 

6.13 Find a solution of the same problem such that u(x, y) = x 4 + y 4 for 
x 2 + y 2 = 1. 

6.14 Solve the Dirichlet problem with boundary values it(l, 0) = sin 3 0. 

6.15 Perform the details of the proof of the uniqueness of the solution of Dirich- 
let’s problem. 



6.4 Sturm-Liouville problems 

In our solutions of the problems in the preceding sections, a central role was 
played by a boundary value problem for an ordinary differential equation 
containing a parameter A. This problem proved to have nontrivial solu- 
tions for certain values of A, but these values had the character of being 
“exceptional” . The situation seems loosely similar to a kind of problem that 
the reader should have been faced with in a seemingly completely different 
context, namely linear algebra: eigenvalue problems for an operator or a 
matrix. We shall see that this similarity is really not loose at all! 
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We start with a few definitions. Let V be a space with an inner product 
(*,•). A linear mapping A, defined in some subspace Va of V and having its 
values in V, is called an operator on V. Notice that this definition is slightly 
different from the one that is common in the case of finite-dimensional 
spaces: we do not demand that the domain of definition of the operator be 

A 

the entire space V. We write V 2 Va — > V or A : Va -> V. The image 
of a vector u £ Va is written A(u) or, mostly, simply Au. 

Definition 6.1 An operator A : Va -A>V is said to be symmetric, if 

( Au , v ) = (u, Av) for all u, v £ Va . 

Example 6.7. Let V = L 2 ( T), Va = V nC 2 (T) and let A be the operator 
— D 2 , so that Au — —u". Since u £ C 2 (T), the image Au is a continuous 
function and thus belongs to V. We have 

(Au,v) = - f u'\x)^dx = -[u\x)^x)] n _ + [ u\x)V{x)dx 
n J T 

The integrated parts are zero, because all the functions are periodic and 
thus have the same values at — n and n. □ 

Definition 6.2 An operator A : Va —>V is said to have an eigenvalue A, 
if there exists a vector u £ Va such that u ^ 0 and Au = Xu. Such a vector 
u is called an eigenvector, more precisely, an eigenvector belonging to the 
eigenvalue A. The set of eigenvectors belonging to a particular eigenvalue 
A (together with the zero vector) make up the eigenspace belonging to A. 

Example 6.8. We return to the situation in Example 6.7. If u{x) = 
a cos nx + b sin nx , where a and b are arbitrary constants and n is an integer 
>0, then clearly Au = n 2 u. In this situation we have thus the eigenvalues 

A = 0, 1, 4, 9, For A = 0, the eigenspace has dimension 1 (it consists of 

the constant functions), for the other eigenvalues the dimension is 2. (The 
fact that this is the complete story of the eigenvalues of this operator was 
shown in Sec. 6.3, “Method 2.”) □ 

For symmetric operators on a finite-dimensional space there is a spectral 
theorem, which is a simple adjustment to the case of complex scalars of the 
theorem from real linear algebra: If A is a symmetric operator defined on all 
of C n (for example), then there is an orthogonal basis for C n , consisting 
of eigenvectors for A. The proof of this can be performed as a replica of 
the corresponding proof for the real case (if anything, the complex case is 
rather easier to do than a purely “real” proof). In infinite dimensions things 
are more complicated, but in many cases similar results do hold there as 
well. 

First we give a couple of simple results that do not depend on dimension. 
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Lemma 6.1 A symmetric operator has only real eigenvalues, and eigen- 
vectors corresponding to different eigenvalues are orthogonal 



Proof. Suppose that Au = Xu and Av = fiv, where u ^ 0 and v ^ 0. Then 
we can write 



\(u,v) = (A u,v) = ( Au,v ) = (u,Av) = {u,pv) = fi{u,v). (6.15) 

First, choose v = u, so that also /i = A, and we have that A||ri|| 2 = A||ig|| 2 . 
Because of u ^ 0 we conclude that A = A, and thus A is real. It follows 
that all eigenvalues must be real. But then we can return to (6.15) with the 
information that fi is also real, and thus (A — fi) (u, v) = 0. If now A — p ^ 0, 
then we must have that (u, v) =0, which proves the second assertion. □ 

Regrettably, it is not easy to prove in general that there are “sufficiently 
many” eigenvectors (to make it possible to construct a “basis,” as in finite 
dimensions). We shall here mention something about one situation where 
this does hold, the study of which was initiated by Sturm and Liouville 
during the nineteenth century. As special cases of this situation we shall 
recognize some of the boundary value problems studied in this text, starting 
in Sec. 1.4. 

We settle on a compact interval I = [a, b\. Let p E C' 1 (/) be a real- valued 
function such that p(a) ^ 0 ^ p(6); let q E C(I) be another real- valued 
function; and let w E C(I) be a positive function on the same interval 
(i.e., w(x) > 0 for x E I). We are going to study the ordinary differential 
equation 



(E) (pu f ) f + qu + A wu = 0 

+ u + A w(x)u{x) = 0, x E I. 

Here, A is a parameter and u the “unknown” function. Furthermore, we 
shall consider boundary conditions , initially of the form 

(B) Aou(a) + A\u f (a) = 0, Bou(b) + B\u' (b) = 0. 



Here, Aj and Bj are real constants such that (Ao, A\) ^ (0,0) ^ (Ho, Hi). 

Remark. If we take p(x) = w(x) = 1, q(x) = 0, Ao = Ho = 1 and Ai = Hi = 0, 
we recover the problem studied in Sec. 1.4. □ 

The problem (E)+(B) is called a regular Sturm-Liouville problem. We 
introduce the space L 2 (I , w), where w is the function occurring in (E). This 
means that we have an inner product 



(u,v) = j u(x)v(x)w(x) 



dx. 



In particular, all functions u E C(I) will belong to L 2 (I,w), since the 
interval is compact. 
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We define an operator A by the formula 

An = - — ((pu'Y + qu), 
w v 7 

Va = {u G C 2 (/) : Au G L 2 (I,w) and it satisfies (B)}. 

Then, (E) can be written simply as Au = Xu. The problem of finding non- 
trivial solutions of the problem (E)+(B) has been rephrased as the problem 
of finding eigenvectors of the operator A. (The fact that Va is a linear space 
is a consequence of the homogeneity of the boundary conditions.) 

The symmetry of A can be shown as a slightly more complicated parallel 
of Example 6.7 above. On the one hand, 



o b 

(Au, v ) = ~ J ~ {iP^Y + Q u ) vwdx = -J ((pu'Y + qu)v dx 

a a 

b b b 

= — j (pu')'vdx — j quvdx = — [p tx 7 tj] ^ + J (pu'v' — quv) dx. 

a a a 

On the other hand (using the fact that p, q and w are real- valued) , 
b b b 

(u,Av) = J u- ((pv'Y qv)w dx = — J u(pv'Y dx — J uqvdx 

a a a 

b 

= —[ u Pv'] b a 4- J \u' pv' — uqv) dx. 



We see that 



(Au, v ) — (it, Av) = [puv f — pu'v] k = 

But the determinant in this expression, for x = a, must be zero: indeed, we 
assume that both u and v satisfy the boundary condition (B) at a, which 
means that 

Aou(a) + Aiu'(a) = 0, 

Aov(a) + Aiv'(a) = 0. 

This can be considered to be a homogeneous linear system of equations 
with (the real numbers) Aq and A\ as unknowns, and it has a nontrivial 
solution (since we assume that (A 0 ,Ai) ^ (0,0)). Thus the determinant is 
zero. In the same way it follows that the determinant is zero at x = b. We 
conclude then that 



p(x) 



u(x) u'(x) 



v(x) v f (x) 



(Au,v) = (u,Av), 
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so that A is symmetric. 

In this case, the symmetry is achieved by the fact that a certain substi- 
tution of values results in zero at each end of the interval. Clearly, this is 
not necessary. An operator can be symmetric for other reasons, too. We 
shall not delve deeper into this in this text, but refer the reader to texts 
on ordinary differential equations. 

For the case we have sketched above, the following result holds. 

Theorem 6.1 (Sturm— Liouville’s theorem) The operator A, belong- 
ing to the problem (E)+(B), has infinitely many eigenvalues, which can 
be arranged in an increasing sequence: 

Ai < A2 < A3 < • • • , where A n — >• 00 as n — > 00. 

The eigenspace of each eigenvalue has dimension 1 , and if (p n is an eigen- 
vector corresponding to X n , then {(Pn}%Li is a complete orthogonal system 
in L 2 (I,w). 

This can be rewritten to refer directly to the differential equation prob- 
lem: 

Theorem 6.2 The problem (E)+(B) has solutions for an infinite num- 
ber of values of the parameter X, which can be arranged in an increasing 
sequence: 



Ai < A2 < A3 < • • • , where X n 00 as n 00. 

For each of these values of X, the solutions make up a one- dimensional 
space, and if <p n is a non-zero solution corresponding to X n , the set {(p n }^L 1 
is a complete orthogonal system in L 2 (I,w). 

Proofs can be found in texts on ordinary differential equations. 

It is of considerable interest that one gets a complete orthogonal system. 
We already know this to be true in a couple of special cases. First we have 
the problem 

u"(x) + A u(x) = 0, 0 < x < 7r; tx(0) = u(tt) = 0, (6.16) 

that we first met already in Sec. 1.4; here the eigenfunctions are ip n (x) = 
sin nx, and according to Sec. 5.4 these are complete in L 2 (0, tt) (with weight 
function 1). Secondly, we have seen this problem, treated in Example 6.1 
of Sec. 6.2: 

u"(x) + A u(x) = 0, 0 < x < 7r; ^'(O) = u' (n) = 0. (6-17) 

There we found the eigenfunctions <po{x) = \ and y? n (x ) = cos nx, and we 
have seen that they are also complete in L 2 (0,7 t). 
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In the exercises, the reader is invited to investigate a few more problems 
that fall within the conditions of Theorem 6.2. 

If the assumptions are changed, the results may deviate from those of 
Theorem 6.2. We have already seen this in Examples 6.7 and 6.8 of the 
present section. There, we studied the operator — D 2 on T, which corre- 
sponds to the problem 

u n (x) + A u(x) = 0, —7 r < x < 7r; u(— 7t) = u( 7r), u f (— n) = 

The boundary conditions are of a different character from (B): they mean 
that u and u' have periodic extensions with period 2tt (so that they can 
truly be considered to be functions on the unit circle T). They are also 
commonly called periodic boundary conditions. (In contrast, the conditions 
considered in (B) are said to be separated: the values at a and b have no 
connection with each other.) In this case the eigenspaces (except for one) 
have dimension 2. If we choose orthogonal bases in each of the eigenspaces 
and pool all these together, the result is again a complete system in the 
relevant space, which is L 2 (T). 

Yet another few examples are given in the next section. In one of these 
cases it happens that the function p goes to zero at the ends of the compact 
interval; in others the interval is no longer compact. It can be finite, but 
open, and one or more of the functions p, q , and w may have singularities at 
the ends; the interval may also be a half-axis or even the entire real line. All 
these situations give rise to what are known as singular Sturm-Liouville 
problems, and they sometimes occur when treating classical situations for 
partial differential equations. 

Exercises 

6.16 Determine a complete orthogonal system in L 2 (0, 7 r) consisting of solutions 
of the problem 

u"(x) + A u{x) = 0, 0 < x < 7r; u(0) = u jjr) = 0. 

6.17 The same problem, but with boundary conditions 

u( 0) = u( 7r) -I- u (n) = 0. 



6.18 Show that the problem 



d_ 

dx 




+ 



A 

\/l — x 2 



u(x) = 0, 



-1 < X < 1 



has the eigenvalues A = n 2 (n = 0, 1, 2, . . .) and eigenfunctions T n (x) = 
cos(narccosx) for A = n 2 . (You are not expected to prove that these are 
all the eigenvalues and eigenfunctions of the problem.) 
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6.5 Some singular Sturm-Liouville problems 



Some celebrated problems in classical physics lead up to problems for or- 
dinary differential equations that are similar to the problems considered 
in the preceding section. We review some of these problems here, partly 
because of their historical interest, but also because they have solutions 
that are polynomials that we met in Sec. 5.5-6. 

The Legendre polynomials are solutions of the following singular Sturm- 
Liouville problem. Let I = [—1,1] and study the problem 

((1 - x 2 ) u'(x)) + A u(x) = 0, — 1 < x < 1, 

with no boundary conditions at all (except that u(x) should be defined 
in the closed interval). Here we can identify p(x) = 1 — x 2 , q{x) = 0 and 
w(x) = 1. Since p(x) = 0 at both ends of the interval, the corresponding 
operator A will be symmetric if one takes T>a = {u G C 2 (I) : Au G L 2 (J)} 
(the reader should check this, which is not difficult). It can be proved that 
this problem has eigenvalues A = n(n + 1), n = 0, 1,2, . . ., and that the 
eigenfunctions are actually (multiples of) the Legendre polynomials. 

Remark. The origin of this problem is the three-dimensional Laplace equation 
in spherical coordinates (r,0,0), defined implicitly by 

{ x = r sin 0 cos 0 r > 0, 

y = r sin <f> sin 6 0 < (ft < 7r, 

Z = r COS <f —IT < 9 < 7T. 

In these coordinates, the equation takes the form 

. _ d 2 u 2 du 1 d 2 u cot 4> du 1 d 2 u _ 

U dr 2 r dr r 2 d4> 2 r 2 d<t> + r 2 sin 2 0 d6 2 

This can also be written as 

r(ru ) + -J— (sin (ft uA + - ~o ~ uee = 0. (6.18) 

A function f(x,y,z) is said to be homogeneous of degree n, if f(tx,ty,tz) = 
t n f{x,y,z) for t > 0. This means that / is completely determined by its values 
on, say, the unit sphere, so that it can be written /(x,y, z) = r n g((f),6) for a 
certain function g. We now look for solutions u n of (6.18) that are homogeneous 
of degree n; these solutions are called spherical harmonics. Write 



u n (x,y,z) = r n S n (</>,0), 

where S n is called a spherical surface harmonic. Substitution into (6.18) and 
subsequent division by r n gives 



(n + l^SVi + 



1 d 
sin (j) dcf) 



( • a 9S A 



1 d 2 S n 

sin 2 <f) d0 2 



= 0 . 
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Now we specialize once more and restrict ourselves to solutions S n that are in- 
dependent of 9 ; denote them by Z n (</>). The equation reduces to 

(n + 1 )nZ n + “t— 7 -jt (sin^^p-^ = 0- (6.19) 

sm q> a<p\ acp ) 

Finally, we put x = cos 0 and P n (x) = Z n ((j)). The reader is asked (in Exer- 
cise 6.19) to check that the equation ends up as 

(1 - x 2 )P"(x) - 2 xP\x) + n(n + 1 )P n (x) = 0. (6.20) 

This is the Legendre equation. □ 

The Laguerre polynomials are solutions of the following singular Sturm- 
Liouville problem. Take I = [0, oo[, p(x) = x e ~ x , q(x) = 0 and w(x) = e~ x . 
The differential equation is 

-f- (x e~ x u' (x)) + Xe~ x u(x) = 0 
ax v 7 

x u n (x) + (1 — x) u'(x) + A u{x) = 0, x > 0, 

and the “boundary conditions” are that it(0) shall exist (of course) and 
that u(x)/x m shall tend to 0 as x — ► oo for some number m. (The latter 
condition can be phrased thus: u(x) is majorized by some power of x, as 
x — > oo, or u(x) “increases at most like a polynomial”.) The eigenvalues of 
this problem are A = n = 0,l,2,..., and the Laguerre polynomials L n are 
eigenfunctions. 

The Hermite polynomials come from the following singular Sturm-Liou- 
ville problem. On I = R one studies the equation 

e~ x u'(x))+\e~ x u(x) = 0 

ax v 

with the “boundary condition” that the solutions are to satisfy u(x)/x rn —> 
0 as \x\ — >• oo for some m > 0. Eigenvalues are the numbers A = 2 n, 
n = 0, 1, 2, . . . and the Hermite polynomials H n are eigenfunctions. 

Exercise 

6.19 Check that the change of variable x = cos (j) does transform the equation 
(6.19) into (6.20). 



Summary of Chapter 6 

The Method of Separation of Variables 

Given a linear partial differential equation of order 2, with independent vari- 
ables (x, y) and unknown function u(x,y ), together with boundary and/or 
initial conditions 




Historical notes 161 



1. If necessary (and possible), homogenize the equation and as many as 
possible of the other conditions. 

2. Look for solutions of the homogeneous sub-problem having the par- 
ticular form u(x,y ) = X(x)Y(y). This normally leads to a Sturm- 
Liouville problem, and the result should be a sequence of solutions 

== V ) ~ X n (x')Yn(y ') , Tl — 1, 2, 3, . . .. 

3. The homogeneous problem has the “general” solution u = £)c n ifc n , 
where the c n are constants. 

4. Adapt the constants c n to make the solutions satisfy also the non- 
homogeneous conditions. 

5. If you began by homogenizing the problem, don’t forget to re-adapt 
the solution to suit the original problem. 

Definition 

Assume p,q real, w > 0 on an interval I = [a, 6 ]. Then the following is a 
regular Sturm-Liouville problem on I: 

f (pu'Y + qu + Xwu = 0 
\ Aou(a) + Aiu f {a) = 0, B^u{b) + B\v!{b) = 0 

With the Sturm-Liouville problem we associate the operator A , defined for 
functions u that satisfy the boundary conditions by the formula 

Au = - — ({pu'Y -|- qu ) . 



Theorem 

(The Sturm-Liouville theorem) The operator A, belonging to the Sturm- 
Liouville problem, has infinitely many eigenvalues, which can be arranged 
in an increasing sequence: 

Ai < A 2 < A 3 < • • • , where A n — » 00 as n — > 00 . 

The eigenspace of each eigenvalue has dimension 1, and if <p n is an eigen- 
vector corresponding to A n , then {(p n }% Li is a complete orthogonal system 
in L 2 (/, w). 

Formulae for orthogonal polynomials are found on page 254 f. 



Historical notes 

Jacques Charles Franqois Sturm (1803-55) and Joseph Liouville (1809- 
82) both worked in Paris. Sturm was chiefly concerned with differential equations; 
Liouville also did remarkable work in the field of analytic functions and the theory 
of numbers. 
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Problems for Chapter 6 

6.20 Using separation of variables, solve the problem u t = u xx , 0<x<l,£>0; 
u( 0, t) = 1, u( 1, t) = 3, u(x, 0) = 2x + 1 — sin 2ttx. 

u xx = u t t + 2 u t , 0 < x < 7T, t > 0; 

6.21 Find a solution of the problem u( 0, t) = u( 7 r, t) = 0, t > 0; 

^ u(x, 0) = 0, ut(x , 0) = sin 3 x, 0 < a? < 7r. 

6.22 Find a solution of the differential equation u xx = u t + u, 0 < x < 7r, t > 0, 
that satisfies the boundary conditions u(0,t) = u(7r,t) = 0, t > 0, and 
u(x, 0) = x(tt — x), 0 < x < 7T. 

6.23 Find a function u(x, t) such that 

u t = 4 u xx , 0 < x < 4, t > 0; 

< u(0,£) = 10, u(4, £) = 50, £ > 0; 

^ u(x , 0) = 30, 0 < x < 4. 

6.24 Find a solution of the following problem: 

u xx = utt, 0 < x < 7r, £ > 0; 
u(0, £ ) = u(7r, £) = 0, £ > 0; 

u(x, 0) = x(7T — x), 0 < x < 7r; 

ut(x, 0) = sin 2x, 0 < x < it. 

6.25 Determine a solution of the boundary value problem 

f 

u xx + u yy = x, 0 < x < 1, 0 < 2/ < 1; 

< u(x, 0) = u(x, 1) = 0, 0 < x < 1; 

^ u(0, y) = u(l, 2 /) = 0, 0 < y < 1. 

6.26 Find a solution in the form of a series to the Dirichlet problem 

U xx “h Uyy — 0, x ~\ - y lj 

u(x,y) = \x\, x 2 + y 2 = 1. 

6.27 Solve the following problem for the two-dimensional Laplace equation 

u xx + Uyy =0, 0<X<7T, 0 <y <7T] 

< u x (0,y) = u x (7T,y) = 0, 0 < y < 7r; 

u(x, 0) = sin 2 x, u(x,7r) = 0, 0 < x < ir. 

6.28 Find a function u(x, £) such that 

Ut — u xx + cosx, 0 < x < 7r, £ > 0; 

u x (0, £) = u x (7r, £) = 0, £ > 0; u(x, 0) = cos 2 x + 2 cos 4 x, 0 < x < tt. 
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6.29 Solve the following problem for a modified wave equation: 

U xx = Utt -f 2 Ut, 0 < x < 7r, t > 0; 

u(0,t) = u(7T,t) =0, t > 0, 

u(x,0) = sin x + sin 3a:, ut (#, 0) = 0, 0 < x < 7r. 

6.30 Find u = u(x , t) that satisfies the equation = ut + tu, 0 < x < 7r, 

t > 0, with boundary conditions ii(0, t) = w(7r, t) = 0 for t > 0 and initial 

condition u(x , 0) = sin 2a:, 0 < x < tt. 

6.31 Find a bounded solution of the problem for a vibrating beam: 

/ 

Utt Uxxxx — 0, 0 < x < 7r, t > 0; 

< u(0,t) = u(7T,t) = Uxx(0,t) = Uxx(7T,t) = 0, t > 0; 

u(x, 0) = x(tT — x), Ut(x, 0) = 0, 0 < X < 7T. 

6.32 A (very much) simplified model of a nuclear reactor is given by the problem 



J u t = Au xx + Bu , 0 < x < /, t > 0; 

u(0, t) = u(l,t) = 0, t > 0. 

Here, u is the concentration of neutrons, while A and B are positive con- 
stants. The term Au xx describes the scattering of neutrons by diffusion, 
and the term Bu the creation of neutrons through fission. Prove that there 
is a critical value L of the length l such that if l > L, then there exist 
unbounded solutions; whereas if l < L, then all solutions are bounded. 

6.33 In order to get good tone quality from a piano, it is desirable to have 
vibrations rich in overtones. An exception is the seventh partial, which 
results in musical dissonance, and should thus be kept low. Under certain 
idealizations the vibrations of a piano string are described by 



Utt = Uxx , 0 < X < 7T, t > 0; 

u(0, t) = u(7T, t) = 0, t > 0; 

/■ 

l//i, for a < x < a + h, 



u(x, 0) = 0, u t (x,0) = 



0 



otherwise. 



Here a describes the point of impact of the hammer; and h, the width of 
the hammer, is a small number. 

(a) In the form of a series, compute the limit of u(x, t) as h \ 0. 

(b) Where should the point a be located so as to eliminate the seventh 
partial tone? There are a number of possible answers. Which would you 
choose? Explain why! 
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Fourier transforms 



7.1 Introduction 



Suppose that / is piecewise continuous on [— P, P] (and periodic with period 
2 P). For the “complex” Fourier series of / we have 

oo 

f(t) ~ c « exp ( in p *) ’ (7-1) 

n=— oo 

where i fj’ . n 

Cn= 2P J P ^ 6XP \~ in P V dt ( 7 - 2 ) 

One might say that / is represented by a (formal) sum of oscillations with 
frequencies nn/P and complex amplitudes c n . 

Now imagine that P —>> oo, and we want to find a corresponding represen- 
tation of functions defined on the whole real axis (without being periodic). 
We define, provisionally, 

f{P,v) = J p f(t)e- iut dt, we R, (7.3) 



so that c n = — /(P, mr/P). The formula (7.1) is translated into 

Zi 



m 



i 

2 P 



E /to 



^71 



AuJnt 



27T 






U7T 



= 



(7.4) 
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Because of Au n = uo n +i — u n = — , this last sum looks rather like a 
Riemann sum. Now we let P -» oo in (7.3) and define 



/M = lim f(P,u) 

P — yoo 



/: 



f(t)e- luJt dt, ueR 



(7.5) 



(at this point we disregard all details concerning convergence). If (7.4) had 
contained f(uj n ) instead of /(P,a; n ), the limiting process P — » oo would 
have resulted in 

1 7°° - 

(7 - 6) 

The formula couple (7.5) + (7.6) actually will prove to be the desired coun- 
terpart of Fourier series for functions defined on all of R. Our strategy will 
be the following. Placing suitable conditions on /, we let (7.5) define a 
new function /, called the Fourier transform of /. We then investigate the 
properties of / and show that the formula (7.6) with a suitable interpreta- 
tion (and under certain additional conditions on /) constitutes a means of 
recovering / from /. 

Loosely speaking this means that while a function defined on a finite 
interval (such as (— P, P)) can be constructed as a sum of harmonic os- 
cillations with discrete frequencies {u) n = n7r/P : n G Z}, a function on 
the infinite interval ] — oo,oo[ demands a continuous frequency spectrum 
{u : u £ R}, and the sum is replaced by an integral 



7.2 Definition of the Fourier transform 

Assume that / is a function on R, such that the (improper) integral 

[ |/(i)| dt = f \f(t)\dt (7.7) 

J — oo J R 

is convergent; using the notation introduced in Chapter 5, this is the same 
as saying that / G L X (R). In practice we shall only encounter functions 
that are piecewise continuous, i.e., they are continuous apart from possibly 
a finite number of finite jumps in every finite sub-interval of R. For such 
an /, the following integral converges absolutely, and for every real w its 
value is some complex number: 

/M= [ me-^dt. (7.8) 

J R 

Definition 7.1 The function f, defined by (7.8), is called the Fourier 
transform or Fourier integral of f. 
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Common notations, besides /, are T[f] and F (“capital letter = the 
transform of lower-case letter”). In this connection, it is useful to work on 
two distinct real axes: one where / is defined, and the variable is called 
such things as t, x, y; and one where the transforms live and the^variable is 
lj, £, A, etc. We denote the former axis by R and the latter by R. 

Example 7.1. If /(£) = e”^, t £ R, then / £ L 1 ( R), and 

p p oo p 0 

7(w)= / e-ltle-*-* d* = / e - (1+i&,)t dt+ / dt 

Jr Jo J- oo 

1 1 _ 2 

1 + io; 1 — iu 1 + u; 2 ’ 

which can be summarized in the formula 

- ih- 

□ 



Example 7.2. Let f(t) = 1 for \t\ < 1, = 0 for |t| > 1 (i.e., f(t) — 
H(t + 1) — H(t — 1), where H is the Heaviside function as in Sec. 2.6). Then 
clearly / £ L 1 (R), and 

2 e iuJ — e~ iu 2sinw 

— : — = it. = , 

— ioj t c oh uo 




For u = 0 one has e lujt = 1, so that /( 0) = 2 = lim f(u). □ 

The fact noticed at the end of the last example is not accidental. It is a 
case of (b) in the following theorem. 



Theorem 7.1 Iff £ L x { R), the following holds for the Fourier transform 
/• 

(a) / is bounded; more precisely, \f(w)\ < / \f(t)\dt. 

^ ^ Jn 

(b) / is continuous on R. 

(c) lim f{u) = 0. 



Proof (a) follows immediately from the estimate ip(t) dt | < f f \<p(t) \ dt , 
which holds for any interval I and any Riemann-integrable function <p (even 
if it is complex- valued) . 

(b) is more complicated, and we leave the proof as an exercise (see Ex- 
ercise 7.3). 

(c) is a case of the Riemann-Lebesgue Lemma (Theorem 2.2, page 25). 

□ 

When dealing with Fourier series on the interval ( — 7r, 7 t), we have made 
use of special formulae in the case when the functions happen to be even or 
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odd. Something similar can be done for Fourier transforms. For example, 
if / is even , so that f(—t) = f(t), we have 

^ /*oo 

f( u ) = 7T~ / f{t)(cosout — ismout)dt = 2 / f(t)cosoutdt, (7.9) 

J.tt Jo 

from which is seen that f is real if / is real, and that f is even (because 
cos(-out) = cos out). Similarly, for an odd function g, g(— t) = — g(t), it 
holds that 

poo 

g(ou) = — 2i / g(t)smoutdt; (7.10) 

Jo 

if g is real, then g is purely imaginary, and furthermore g is odd. 

Integrals such as those in (7.9) and (7.10) are sometimes called cosine 
and sine transforms. 

Exercises 

7.1 Compute the Fourier transforms of the following functions, if they exist: 

(a) f(t) = t if \t\ < 1, = 0 otherwise. 

(b) f(t) = 1 — |t| if |t| < 1, = 0 otherwise. 

(c) f(t) = sin t. 

(d) f(t) = l/(t-i). 

(e) f(t ) = (sin t)(H(t + 7r) — H(t — n)) ( H is the Heaviside function). 

(f ) /(*) = (cos 7rt) (H (t + \ ) - H (t - | )) . 

7.2 Find the Fourier transforms of f(t ) = e -t i/(t) and g(t) = e*(l — H(t)). 

7.3 A proof of the assertion (b) in Theorem 7.1 can be accomplished along the 
following lines: 

(i) Prove that \ f(ou + h) - f(ou)\ <2 f R \f(t)\ |sin(l/it) | dt. 

(ii) Approximate / by a function g which is zero outside some bounded 
interval, as in the last step of the proof of Theorem 2.2, and use that 
| sin 1 1 < |t|. The proof even gives the result that / is uniformly continuous 
on R. 



7.3 Properties 

In this section we mention some properties of Fourier transforms that are 
useful when applying them to, say, differential equations. 

Theorem 7.2 The mapping T : f ^ f is a linear map from the space 
L 1 (R) to the space Co(R) of those continuous functions defined on R that 
tend to 0 at ±oo. 

Proof. The fact that / G Co(R) is the content of (b) and (c) in Theorem 7.1. 
The linearity of T means just that 

r\f + g]= m + T[g] , T[\ f] = A T[f] 
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when /, g G L^R) and A is a scalar (i.e., a complex number). This is an 
immediate consequence of the definition. □ 

Theorem 7.3 Suppose that f G T 1 (R) and let a be a real number. Then 
the translated function f a (t) = f(t — a) and the function e mt /(£) also belong 
to L 1 ( R), and 

fM = ?\f(t - a)]M = (7.11) 

F{e iat fmu) = f(u;-a). (7.12) 

Proof. For the first formula, start with f a {w) = f R f(t - a)e~' lUJt dt. The 
change of variable t—a = y gives the result. The proof of the second formula 
is maybe even simpler. □ 

These results are often called the delay rule and the damping rule for 
Fourier transforms. Notice the pleasant mathematical symmetry of the for- 
mulae. A similar symmetry holds for the next set of formulae. 

Theorem 7.4 Suppose that f is differentiable and that both f and f be- 
long to L l ( R). Then 

(Df) (w) = f'(u) = T[f] {u) = iwf{w). (7.13) 

If both f(t) and tf(t ) belong to L 1 (R), then f is differentiable, and 

(7.14) 

The proof of (7.13) relies, in principle, on integration by parts: 

/ oo poo 

/'(fie-™ dt = [f^e-™}^ - / f(t)(~iu ,)e~™ dt, 

and one has to prove that the integrated part is zero. We omit the details, 
which are somewhat technical; even though / G T 1 (R), it does not neces- 
sarily have to tend to zero in a simple way as the variable tends to ±oo. 
The second formula can be proved using some theorem on differentiation 
under the integral sign. Indeed, if this operation is permissible, we will have 

= (^/«)e-“<#) =i / R £(/(*)<=-“)<* 




We shall immediately use Theorem 7.4 to find the Fourier transform of 
the function f(t) = e ~ l 2 / 2 . 

Differentiating, we get /'(£) = — te~ l / 2 , and we see that 

f(t)+tf(t)=0. 
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It is easy to see that the assumptions for Theorem 7.4, both formulae, are 
fulfilled. Transformation gives iuf(u) + if f (uj) = 0 or, after division by i, 

/'M +w/(w) = 0 . 



Thus, / satisfies the same differential equation as /. The general solution of 
this equation is easily determined, for instance, using an integrating factor: 
y' + ty = 0 gives y = Ce - * 2 / 2 , C = y( 0). Thus, f(uS) = Ce - ^ 2 / 2 , where 

C = /(0) = [ f(t)e~ l0t dt = f e~ l / 2 dt = y/2n. 

J R 

Summarizing, we have found that 



T 



(a;) = V^e 



Theorem 7.4 implies that Fourier transformation converts differentiation 
into an algebraic operation. This hints at the possibility of using Fourier 
transformation for solving differential equations, in a way that is analogous 
to the use of the Laplace transform. The usefulness of this idea is, how- 
ever (at our present standpoint), somewhat limited, because the Fourier 
integral has problems with its own convergence. For example, the common 
homogeneous ordinary linear differential equations with constant coeffi- 
cients cannot be treated at all: all solutions of this sort of equation consist 
of linear combinations and products of functions of the types cos at , sin at , 
e bt , and polynomials in the variable t. The only function of these types that 
belongs to L 1 (R) is the function that is identically zero. 

There are, however, categories of problems that can be treated. Later in 
this chapter, we shall attack some problems for the heat equation and the 
Laplace equation with Fourier transforms. Also, the introduction of distri- 
butions has widened the range of functions that have Fourier transforms. 
We shall have a glimpse of this in Sec. 7.11, and a fuller treatment is found 
in Chapter 8. 



Exercises 

7.4 Assume that a is a real number / 0 and that / E L 1 (R). Let g(t) = f(at). 
Express <7 in terms of /. 

7.5 Find the Fourier transform of (a) f(t ) = cost, (b) g(t) = e~ ^ sint. 

7.6 If / is defined as in Exercise 7.1 (b), page 168, then /'(t) = 0 for \t\ > 1, 
/'(£) = 1 for — 1 < t < 0, /'(t) = — 1 for 0 < t < 1. Compute F[f'] in two 
ways, on the one hand using Theorem 7.4 and on the other hand by direct 
computation. 

Remark. The fact that f'(t) fails to exist at some points evidently does 
not destroy the validity of (7.13). But / must be continuous. 
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7.7 Find /(w) if f(t) = (a) t e _ * 2/2 , (b) e _(t2+2t) . 

Hint for (b): complete the square, combine formula (7.11), the example fol- 
lowing Theorem 7.4 and Exercise 7.4. Another way of solving the problem 
is indicated in the remark below. 

7.8 Suppose that /(£) has Fourier transform /( to) = e~ u . Determine the trans- 
forms of f( 2 t), f( 2 t + 1), f( 2 t + 1) e lt . 

7.9 Does there exist an / G T X (R) such that f(w) = 1 — cos a;? 

7.10 Suppose that / has the Fourier transform /. Find the transforms of 
f(t) cos at and f(t) cos 2 at ( a real ^ 0). 

Remark on Exercise 7.7: Problems such as 7.7 (b) and 7.8 can also be solved by 
writing out the Fourier integral, then rewriting it and changing variables so as to 
reshape the integral into a recognizable transform. For example, 



T[e~^ + 2 t) ](u;) = / e _ ( t2 + 2t + 1) + 1 e ~ iujt dt = e e~ (t+1)2 ( 

J R J R 



t + 1 = 2//V2, ] 

t = -7= - 1 > = — / e ~v 2 / 2 e — Mv/V2— 1) ( 

v2 I v/2 /„ 

dt — dy/\/2 J 



[ e -y2/2 e _iya,/ ' /2 dy. 
v2 



The last integral is the Fourier transform of e * / 2 , computed at the point cj/\/2, 
which means v^expT— 1 ^ = y/27re~ UJ ^ 4 . The answer to the problem is 



' e- (t2+2t) ]H - e ~— • ^e - 2/4 - v^e 1+ ^-^ 2 . 



7.4 The inversion theorem 

We now formulate the result that constitutes our promised precise version 
of the formula (7.6) on page 166. 

Theorem 7.5 (Inversion theorem) Suppose that f G ^(R), that f is 
continuous except for a finite number of finite jumps in any finite interval, 
and that f(t) = | (/(£+) + f(t — )) for all t. Then 

/(f 0 ) = lim T [ /(w) e luJt ° du) (7.15) 

A -± oo Z7T J_ A 

for every to where f has (generalized) left and right derivatives. In partic- 
ular, if f is piecewise smooth (i.e., continuous and with a piecewise contin- 
uous derivative) , then the formula holds for all to G R. 
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The result, and also the proof, is very similar to the convergence theorem 
for Fourier series. Just as for these series, the convergence properties depend 
essentially on the local behavior of f(t) for t near £ 0 - To accomplish the 
proof we need an auxiliary lemma. 

Lemma 7.1 



f°° sin Au 7 r 

/ du = — for A > 0. 

Jo u 2 J 

It is easy to check, by the change of variable Au = t , that the integral is 
independent of A (if A > 0), so one can just as well assume that A = 1. The 
integral is not absolutely convergent, and J 0 °° in this case stands for the 

limit of f Q as X oo. There is no quite simple way to compute it. One 
method is using the calculus of residues, and textbooks on complex analysis 
usually contain precisely this integral as an example of that technique. 

Another attempt could build on the idea that 1 ju — / 0 °° e~ ux dx, which 
might be substituted into the integral: 




There is, however, a difficulty here: the double integral is not absolutely 
convergent (the integrand is too large when x is close to 0), which makes 
it hard to justify the change of order of integration. However, we will not 
delve deeper into this problem. 

Proof of Theorem 7.5. Put 



s(t 0 , A) = f- J f(tj}) e ltouJ du 

and rewrite this expression by inserting the definition of f(uj): 
s(to,A) = f K f A (/°° /(*) e _ia,i di) e iwt ° dw 

uj=A 

dt 

u)=—A 

*J- oo t 0 -t - 7T J —oq J " / U — 

Switching the order of integration is permitted, because the improper dou- 
ble integral is absolutely convergent over the strip (t, u) e R x [— A , A], and 



1 roo nS i -j po 



m 



_ 1 [°° sinA{t 0 -t) 1 r 



f(+ , 



e lU){t 0 -t ) 

j(t 0 - 1)_ 
sin Au , 

mi 
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in the last step we have put to~t = u. We are now in a situation very much 
the same as in the proof of the convergence of Fourier series; but there is a 
complication inasmuch as the interval of integration is unbounded. Using 
the lemma we can write 



2 f ,/ N sin Au 7 N 2 f , t sinAu _ 

-/ f(to-u) du-f(t 0 -) = - {f(to-u)-f(t 0 -)) du. 

n Jo u ^ Jo u 

(7.16) 

Now let s > 0 be given. Since we have assumed that / £ L 1 (R), there 
exists a number X such that 



| /(to - u)| du < £. 



Changing the variable, we find that 
f°° sinAu , f°° g 



0 as A 



The last integral in (7.16) can be split into three terms: 



/(to -u)- /(to-) 



sin Au du -\ — 
7 r 



poo 

/ /(to 
JX 



v sm Au _ 

u) du 

u 



sin Au 



du = I\ + I 2 — / 3 - 



The term Is tends to zero as A — > 00 because of (7.17). The term I 2 can 
be estimated: 

. _ . 2 f°° sin Au . 2 f°° 

\h\= - / f(to~u) du < - / \f(t 0 -u)\du<£. 

ft Jx u ft Jx 

In the term I\ we have the function u g(u ) = (/(to — u) — f(to))/(—u). 

This is continuous except for jumps in the interval (0,X), and it has the 
finite limit g(0+) = f' L {t 0 ) as u \ 0; this means that g is bounded and thus 
integrable on the interval. By the Riemann-Lebesgue lemma, we conclude 
that Ii 0 as A —»> 00 . All this together gives, since e can be taken as 
small as we wish, 



ft Jo u 



f (to - ) as A 00 . 



A parallel argument implies that the corresponding integral over (— 00 , 0) 
tends to /(to+). Taking the mean value of these two results, we have com- 
pleted the proof of the theorem. □ 

Remark. If Jg |/(o;)| du is convergent, i.e., / 6 T 1 (R), then (7.15) can be written 
as the absolutely convergent integral 



/(M = h L 



/He" 
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but in general one ha s to make do with the symmetric limit in (7.15). □ 



Example 7.3. For f(t) = e W we have f((j) = - ~ . Since / is piece- 

1 + or 

wise smooth it follows that 



e-l*l = 1 lim / 

7T A-+oc J_ A 



e iuji 
1 -\-LU 2 



du. 



In this case / happens to be absolutely integrable, and we can write simply 



e 




e iut 

1 + u; 2 



duj. 



We can switch letters in this formula — t and u are exchanged for each other 
— and then we also change the sign of cj, and we get (after multiplication 
by 7 r) the formula 



7T 6 



-M 



L 



e-iojt 

1 + t 2 



dt. 



In this way we have found the Fourier transform of 1/(1 + £ 2 ), which is 
rather difficult to reach by other methods: 



T 



1 

1 + t 2 



(a;) = 7r e 



□ 



Example 7.4. For the function / in Example 7.2 (page 167) we have 

cv x 2 sin a; _ _ . . . . . . . 

j{uj) = . In this case, the inversion integral is not absolutely conver- 



UJ 



gent. The theorem here says that 



lim - 
A— too 7 T 




sin a; 

UJ 



e iujt duo = 



1 as \t\ < 1, 

< | as t — ±1, 
0 as \t\ > 1. 



□ 



Example 7.5. When using a table of Fourier transforms (such as on page 
252 f. of this book), one can make use of the evident symmetry properties 
of the transform itself and the inversion formula in order to transform 
functions that are found “on the wrong side of the table.” We have actually 
seen an instance of this idea in Example 7.3 above. As a further example, 
suppose that a table contains an item like this: 



f(t) 7m 



te-W 



—4 iu> 

(1+w 2 ) 2 
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From this one can find the transform of the function 



9{t) = 



—4 it 
(1 + t 2 ) 2 



by performing two steps: 

1. Switch sides in the table, switching variables at the same time: 



—4 it 
(1 + t 2 ) 2 



uje 



2 . 



Multiply the right-hand side by 2ir and change the sign of the variable 
there: 



—4 it 
(1 + t 2 ) 2 



2iruje I ^ 



This is now a true entry in the table. It may be an aesthetic gain to divide 
it by —4 i to get 



t 

(1 + t 2 ) 2 






□ 



Example 7.6. When working with even or odd functions, the Fourier 
transform can be rewritten as a so-called cosine or sine integral, respectively 
(see p. 168). In these cases, the inversion formula can also be rewritten so 
as to contain a cosine or a sine, instead of a complex exponential. Indeed, 
if g is even, one gets the following couple of formulae: 



roo ^ r ° ° 

g(u) = 2 g(t) cos wtdt, g(t) = - g{> 

Jo K Jo 

and if h is odd, it looks like this: 

— i r°°~ 

h(uj) = — 2i / h{t) smut dt, h(t) = — / h(uo) si 
Jo K Jo 



uo) cos too duo , 



sin tu duo. 



(The reader should check this.) In applied literature, one often meets these 
“cosine” and “sine” transforms with slightly modified definitions. □ 



Remark. In the literature one can find many variations of the definition of the 
Fourier transform. We have chosen the conventions illustrated by the formula 
couple 





